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it's a dirty theory... You have a theory which is fundamentally ambiguous. " - Bell 

"How wonderful that we have met with a paradox. Now we have some hope of making 
progress. " - Bohr 



© Eric G. Cavalcanti, 2007 



To Shana, who always believed, for her love and support. 



Statement of Originality 



I declare that the work presented in the thesis is, to the best of my knowledge and belief, 
original and my own work, except as acknowledged in the text or in the Statement of 
Contribution to Jointly-Published Work below, and that the material has not been 
submitted, either in whole or in part, for a degree at this or any other university. 

Statement of Contribution to Jointly-Published Work 

Some of the content of this Thesis is adapted from papers published or submitted for 
publication jointly with other authors. The extent of contribution of other authors is 
detailed in the following. 

Section 3.4 is adapted from reference 5 of the List of Publications. The work reproduced 
in this section was accomplished by me under the guidance and supervision of Dr. 
Margaret Reid. 

Chapter 4 is a reproduction, with some adaptations, of reference 3. Most of the work in 
this chapter was accomplished by me with the guidance and supervision of Prof. Peter 
Drummond. The detector inefficiency calculation in 4.4.2 was done by Margaret Reid. 
The proofs of Section 4.5 were performed in collaboration by Chris Foster and me. 

Chapter 5 is a reproduction, with some adaptations, of reference 6. Most of the work in 
this chapter was accomplished by me with the guidance and supervision of Dr. Margaret 
Reid. The work in Section 5.7 and figures 5.1-5.7 and 5.9 were done by Margaret Reid. 



(Eric G. Cavalcanti, Candidate) 



(Peter D. Drummond, Principal Advisor) 



Acknowledgements 



So many people contributed, directly or indirectly, to the realisation of this thesis that 
it is a daunting task to acknowledge them all. So before going any further, let me first 
say that the following is not at all an exhaustive attempt. 

First of all I would like to thank Halina Rubinsztein-Dunlop. If it was not for her I 
would not be here today, as she was my first contact in the University of Queensland and 
officially my initial supervisor. Halina supported my decision to change from experiment 
to theory and helped me find a suitable supervisor, always with my best interest at heart. 
I am deeply thankful for that. 

A good thesis supervisor must find a fine balance between suggesting problems to and 
motivating their students and allowing them to pursue their own interests. It was with 
great satisfaction that I found that my thesis supervisors, Peter Drummond, Margaret 
Reid and Karen Kheruntsyan, have found just such perfect balance. I never had a lack of 
interesting and stimulating problems to work on, due to their input and suggestions, but 
also felt that I always had enough time to think about other problems and complement 
my studies with intellectual freedom. This had a great impact in how my career and 
interests have developed and I can't thank Peter, Margaret and Karen enough for that. 

Another group of people had an impact on my PhD just as important as that of my 
supervisors, and certainly more frequent. Those were my office colleagues and now 
friends (in alphabetical order so none of them feels less valued), the "usual suspects" 
Andy, Chris, Paulo and Yeong-Cherng. They were always ready to listen to my philo- 
sophical ramblings and keen to help with my mathematical difficulties (some of these 
even lead to collaborations). I will sincerely miss the friendly, supportive and relaxed 
enviroment we maintained over these years. 

Beyond that office there were many people, among friends, colleagues and helpful staff, 
who made my PhD the rich and memorable experience that it was. I would particularly 
like to thank Howard Wiseman for sharing his knowledge, reading this manuscript and 
suggesting corrections, and for giving me a job. But to name them all would take much 
more space than I can fit in this page. You know who you are, and I thank you all 
wholeheartedly for being there. 



List of Publications 



The following is a list of the authors' publications which are related to the theme of 
this thesis, in chronological order. 



1. M. D. Reid and E. G. Cavalcanti, Macroscopic quantum Schrodinger and Einstein- 
Podolsky-Rosen paradoxes, Journal of Modern Optics 52, 2245 (2005). 

2. E. G. Cavalcanti and M. D. Reid, Signatures for generalized macroscopic super- 
positions, Physical Review Letters 97, 170405 (2006). 

3. E. G. Cavalcanti, C. J. Foster, M. D. Reid and P. D. Drummond, Bell inequalities 
for continuous-variables correlations, Physical Review Letters 99, 210405 (2007). 

4. E. G. Cavalcanti and M. D. Reid, Uncertainty relations for the realization of 
macroscopic quantum superpositions and EPR paradoxes, Journal of Modern Op- 
tics 54, 2373 (2007). 

5. E. G. Cavalcanti and M.D. Reid, Criteria for generalized macroscopic and S-scopic 
quantum superpositions, accepted for publication on Physical Review A (2008). 

6. E. G. Cavalcanti, M. D. Reid, P. D. Drummond and H. A. Bachor, Unambiguous 
signatures of entanglement and Bohm 's spin EPR paradox, larXiv:0711. 37981 



Abstract 



In recent decades there has been a resurge of interest in the foundations of quantum the- 
ory, partly motivated by new experimental techniques, partly by the emerging field of 
quantum information science. Old questions, asked since the seminal article by Einstein, 
Podolsky and Rosen (EPR), are being revisited. The work of John Bell has changed the 
direction of investigation by recognising that those fundamental philosophical questions 
can have, after all, input from experiment. Abner Shimony has aptly termed this new 
field of enquiry experimental metaphysics. The objective of this Thesis is to contribute 
to that body of research, by formalising old concepts, proposing new ones, and finding 
new results in well-studied areas. Without losing from sight that the appeal of exper- 
imental metaphysics comes from the adjective, every major result is followed by clear 
experimental proposals with detailed analysis of feasibility for quantum-atom optical 
setups. 

After setting the appropriate terminology and the basic concepts, we will start by 
analysing the original argument of Einstein, Podolsky and Rosen. We propose a general 
mathematical form for the assumptions behind the EPR argument, namely those of local 
causality and completeness. That formalisation entails what was termed a Local Hidden 
State model by Wiseman et ai, which was proposed as a formalisation of the concept of 
steering first introduced by Schrodinger in a reply to the EPR paper. Violation of any 
consequences that can be derived from the assumption of that model therefore implies 
a demonstration of the EPR paradox. We will show how one can then re-derive the 
well-known EPR-Reid criterion for continuous-variables correlations, and derive new 
ones applicable to the spin setting considered by Bohm. 

The spin set-up of the EPR-Bohm paradox was used by Bell to derive his now famous 
theorem demonstrating the incompatibility of the assumption of local causality and the 
predictions of quantum mechanics. The inequalities which bear his name can be derived 
for any number of discrete outcomes, but so far there has been no derivation which 
can be directly applied to the continuous-variables case of the original EPR paradox. 
We close the circle by deriving a class of inequalities which make no explicit mention 
about the number of outcomes of the experiments involved, and can therefore be used 
in continuous-variables measurements with no need for binning the continuous results 
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into discrete ones. Apart from that intrinsic interest, these inequalities could prove 
important as a means to perform an unambiguous test of Bell inequalities since optical 
homodyne detection can be performed with high detection efficiency. The technique, 
which is based on a simple variance inequality, can also be used to re-derive a large class 
of well-known Bell-type inequalities and at the same time find their quantum bound, 
making explicit from a formal point of view that the non-commutativity of the local 
operators is at the heart of the quantum violations. 

Finally, we address the issue of macroscopic superpositions originally sparked by the 
infamous "cat paradox" of Schrodinger. We consider macroscopic, mesoscopic and 
'S'-scopic' quantum superpositions of eigenstates of an observable, and develop some 
signatures for their existence. We define the extent, or size 5" of a (pure-state) su- 
perposition, with respect to an observable X, as being the maximum difference in the 
outcomes of X predicted by that superposition. Such superpositions are referred to 
as generalised 5'-scopic superpositions to distinguish them from the extreme superpo- 
sitions that superpose only the two states that have a difference S in their prediction 
for the observable. We also consider generalised S'-scopic superpositions of coherent 
states. We explore the constraints that are placed on the statistics if we suppose a 
system to be described by mixtures of superpositions that are restricted in size. In this 
way we arrive at experimental criteria that are sufficient to deduce the existence of a 
generalised ^-scopic superposition. The signatures developed are useful where one is 
able to demonstrate a degree of squeezing. 
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Chapter 1 
Introduction 



In a recent book by Lee Smolin, that author proposed a list of the 5 greatest problems 
in contemporary physics. In second place, just after the problem of quantum gravity, 
were the foundational problems of quantum mechanics. Below this were the problems 
of unification of particles and forces, of explaining the free constants of the standard 
model and the problem of dark matter and dark energy. 

The prominent position may sound quaint for those who have been taught that the prob- 
lems in the quantum foundations were all solved many years ago by Bohr, Heisenberg, 
von Neumann and the other founders of the theory. That impression is especially un- 
derstandable given the enormous empirical success of the theory. However, a large part 
of the community is starting to recognise that the problems that Einstein, Schrodinger 
and others have raised since the theory's beginnings are as relevant and urgent as ever. 

Two reasons may be advanced as prime contributors to this increased interest in the 
quantum foundations. Firstly, the emergence of the field of quantum information and 
computation, which aims to harness the quantum nature of the world for previously 
impossible tasks, has raised physicists' awareness for the foundational problems by 
exposing a larger audience to the bizarre nature of quantum phenomena. 

Secondly, some theorists such as Smolin are starting to suspect that the failures to find 
a quantum theory of gravity may be related to our failure of understanding quantum 
mechanics. Success in the first of the above problems, to those authors, will have to 
come hand in hand with success in the second. 

This does not at all mean that these authors advocate a return to Einstein's dream of a 
local and realist theory. Since Bell's famous 1964 theorem, and the many experiments 
that confirm violation of Bell inequalities, we know this is a hopeless goa|l|. But fol- 
lowing the quote from the same Bell on page EH there are professional problems with 

^Although, strictly speaking, there are some open loopholes in all violations of Bell's inequalities 
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quantum mechanics. Those like Bell who point out the contradictions within the theory, 
most notably those arising out of the so-called measurement problem, are not merely 
indicating that quantum theory is not locally realistic — no-one was more aware of that 
than Bell! Those theorists are pointing out that we ought to have a coherent picture 
of the whole of reality, not just of experimentalists' laboratory fiddlings, even if that 
picture turns out to be quite distinct from any classical one. 

The return to a more professional attitude, in my opinion, will have to include a more 
careful attention to philosophical issues. Philosophers have since long battled with 
the conceptual quandaries which quantum mechanics forces us to face. A professional 
attitude towards the quantum questions cannot avoid using terms such as ontology and 
epistemology. Even if Bohr is right and quantum mechanics regards not Nature herself, 
but regards what we can say (and therefore what we can know) about Nature, we 
should be able to say clearly what we in this sentence means, and we should be able to 
understand how our knowledge seems to follow well-defined physical laws. 

The present situation with quantum mechanics could be compared to the situation of the 
Special Theory of Relativity before Einstein interpreted the Lorentz transformations. 
Eintein's revolution was one of interpretation, and it lead to a revolution in how we 
would come to understand and use the theory of relativity. It is also interesting to 
conjecture about what would happen with the General Theory if the first breakthrough 
of interpretation achieved by the Special Theory were not laid down. With historical 
hindsight it is easy to see the value of Einstein's interpretational leap. However, before 
1905 it was unthinkable that the solution to the problem of the electrodynamics of 
moving bodies would lead to such a deep restructuring of our basic fundamental notions 
about space and time. 

Similarly, one could argue that we are in a similar pre-revolutionary phase with respect 
to quantum mechanics (not necessarily in the sense that the revolutionary leap is im- 
minent, but that it is yet to come). And pointing out the current empirical successes of 
the theory is even more of a reason to pursue its foundational problems. If the present 
quagmire of postulates and quantisation rules is so successful, one can only dream of 
what we could achieve with a satisfactory understanding. 



CS78I . I gIsOTI I. It seems unlikely that local realism will be restored when those loopholes are closed, but 
it is of fundamental importance to be able to settle the issue once and for all. We will have something 
to say about that in Chapter HI 
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1.1 "Experimental Metaphysics" 

To understand the source of the conflicts in the foundations of Quantum Mechanics, it 
is essential to know where and how our classical models and intuitions start to fail to 
describe a quantum world. This is the su bject o f experimental metaphysics. The term 
was originally coined by Abner Shimony [shisd l to describe the new area of enquiry 
opened by Bell in 1964 when he recognised the existence of experimentally testable 
implications that could be derived from some general metaphysical assumptions — 
namely, those that go under the rubric of local realism^. For the flrst time, it was clearly 
recognised that very general philosophical theses could have input from experiment. 

At the time of Bell those questions were not part of the concerns of most physicists, but 
today we have learned to perceive the nonlocality evidenced by Bell as a resource. The 
fields of quantum information and computation rely on these counter-intuitive features 
of Quantum Mechanics for speeding up computational tasks or achieving results — 
such as unconditionally secure quantum cryptography — impossible to achieve before. 
It therefore becomes an important task to map those resources and recognise how 
exactly they are distinct from classical resources. This is another problem towards 
which Experimental Metaphysics can contribute. 

The purpose of this thesis is to contribute to that body of research, by formalising old 
concepts, proposing new ones, and finding new results in well-studied areas. Without 
losing from sight that the appeal of experimental metaphysics comes from the adjective, 
every major result is followed by clear experimental proposals with detailed analysis of 
feasibility for quantum-atom optical setups. 



1.2 Outline of the Thesis 

In Chapter [2] we set up the appropriate terminology and the basic concepts. Most of 
it will be simply careful definitions of standard concepts, but some definitions may be 
new and some results and consequences may not have been fully appreciated before. In 
particular, I present a new result on a relation between signal locality and the irreducible 
unpredictability of Nature. 

Chapter [3] is related to publications 4 and 6 of the List of Publications. In that cha pter, 



EPR35I. and 



we analyse the original argument of Einstein, Podolsky and Rosen (EPR) 
propose a general mathematical form for the assumptions behind that argument, namely 
those of local causality and completeness of quantum theory. That will entail what was 



^To use standard terminology. A more careful nomenclature will be introduced in Chapter [2l 
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termed a Local Hidden State model by Wiseman et al. |WJD07l |. which was propo sed 
as a formalisation of the concept of steering first introduced by Schrodinger |Sch35l | in 
a reply to the EPR paper. Violation of any consequences that can be derived from 
the assumption of that model therefore implies a demonstration of the EPR paradox. 



We will show how one can then re-derive the well-known EPR-Reid criterion iReiS 



for continuous- variab 
considered by Bohm 



es corr elations, and derive new ones applicable to the spin setting 



BoMll 



The spin set-up of the EPR-Bohm paradox was used by Bell |Bel64l | to derive his now 
famous theorem demonstrating the incompatibility of the assumption of local causality 
and the predictions of quantum mechanics. The inequalities which bear his name can 
be derived for any number of discrete outcomes, but so far there has been no deriva- 
tion which can be directly applied to the continuous- variables case of the original EPR 
paradox. In Chapter [U related to publication 3, we close the circle by deriving a class 
of inequalities which make no explicit mention about the number of outcomes of the 
experiments involved, and can therefore be used in continuous-variables measurements 
with no need for binning the continuous results into discrete ones. Apart from that 
intrinsic interest, these inequalities could prove important as a means to perform an 
unambiguous test of Bell inequalities which does not suffer from the logical loopholes 
CS78 . IgIsOtI that plague all experimental demonstrations so far, since optical homo- 
dyne detection can be performed with high detection efficiency. The technique, which 
is based on a simple variance inequality, can also be used to re-derive a large class 
of well-known Bell-type inequalities and at the same time find their quantum bound, 
making explicit from a formal point of view that the non-commutativity of the local 
operators is at the heart of the quantum violations. 

Finally, in Chapter [5l related to publications 1, 2, 4 and 5, we address the issue of macro- 
scopic superpositions originally sparked by the infamous "cat paradox" of Schrodinger 



Sch35|, presented in the same seminal paper where he coined the terms entanglement 
and steering. We consider macroscopic, mesoscopic and '5'-scopic' quantum superposi- 
tions of eigenstates of an observable, and develop some signatures for their existence. 
We define the extent, or size 5" of a (pure-state) superposition, with respect to an ob- 
servable X, as being the maximum difference in the outcomes of X predicted by that 
superposition. Such superpositions are referred to as generalised S'-scopic superposi- 
tions to distinguish them from the extreme superpositions that superpose only the two 
states that have a difference S in their prediction for the observable. We also consider 
generalised ^-scopic superpositions of coherent states. We explore the constraints that 
are placed on the statistics if we suppose a system to be described by mixtures of su- 
perpositions that are restricted in size. In this way we arrive at experimental criteria 
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that are sufficient to deduce the existence of a generalised ^-scopic superposition. The 
signatures developed are useful where one is able to demonstrate a degree of squeezing. 
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Chapter 2 

Concepts of Experimental Metaphysics 



In June 2007, in a conference on Quantum Foundations in the charming little town of 
Vaxjo, Sweden, dedicated to the 80*'^anniversary of the Copenhagen Interpretation, I 
have noticed an unexpectedly large number of debates about what experimental viola- 
tions of Bell inequalities prove. I was definitely expecting debates about, say, how to 
make sense of a world where Bell inequalities are violated, but not as much about what 
they mean in the first place. It became clear to me that the reason behind many of the 
disagreements (though I wouldn't say all of them) was the lack of common ground in 
definitions of terms such as locality or realism and in the distinction between models 
and the phenomena they predict. 

Local realism is the catch-all term that is usually employed to represent the set of as- 
sumptions which Bell's theorem shows to be incompatible with the quantum mechanical 
predictions and (up to some open loopholes) violated by Nature. Even though the final 
mathematical form of the constraints imposed by local realism is quite uncontroversial, 
there are numerous authors who debate what exactly the underlying assumptions cor- 
respond to and what features of our world view must be modified to accommodate the 
violation of Bell inequalities. See for example the collection of pape rs edited by J. T. 



Gushing and E. McMullin in |CM89l | and the book of Tim Maudlin |Mau94l | for some 
in-depth discussion of these issues. I won't attempt to go as deeply into the myriad 
questions that can be addressed in the philosophical surroundings of Bell's theorem. 
My main purpose is to establish as clearly as possible the terminology I will use. 

That said, in this chapter I will introduce some new usage of terms and some impli- 
cations which may not have been fully recognised before. While some of it will be my 
own work, much of my current understanding of these concepts is due to insights gained 
from discussions with Howard Wiseman, to whom I am grateful. The presentation style 
and most definitions were infiuenced by notes from that author. 
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Concepts of Experimental Metaphysics 



The most important new result of my own will be an interesting connection between 
the assumption of signal locality, or no-faster-than-light-signalling, and the notion of 
predictabilty. I will show that the assumption of signal locality - which must be satisfied 
if one assumes relativistic invariance - together with the experimental observation of 
violation of Bell inequalities, lead to the conclusion that Nature is irreducibly unpre- 
dictable, quite independently of anything from the formalism of quantum mechanics. 
This establishes a deep connection between two of the main puzzles of quantum me- 
chanics: Bell-nonlocality and the Uncertainty Principle. 



2.1 The Minimal Realist-Relativistic Framework 



Many of the debates around the meaning of the Bell theorem regard the status of the 
word 'realism' in 'local realism'. In a recent analysis, Norsen |Nor07al | has argued that 
there's no such assumption among those that go into a derivation of a Bell inequality, 
or at least that any such assumption is so fundamental that no scientific theory can 
be built without it. I would not go as far as saying that there's no such assumption, 
although I agree with that author that there are misconceptions around the term (and 
I will try to clear some of them here) and that it is important to recognise that the 
assumption of realism is part of an underlying framework without which Bell's concept 
of local causality cannot even be expressed. It is not possible, as we will discuss in 
more detail, to maintain local causality while rejecting realism. There are, however, 
other possible usages of the word 'locality' which are possible to be maintained even 
in light of Bell's theorem, most notably the concept of no faster-than-light signalling 
or signal locality. However, those are emphatically not the local causality of Bel|l|, and 
one does not need to reject realism to keep signal locality. 

So let us attempt to understand what 'realism' in 'local realism' can possibly mean. 
What better source for that than the most notable supporter of realism in the 20**^ 
century? Einstein's concept of Reality is clearly expressed in the following passage: 



"If one asks what, irrespective of quantum mechanics, is characteristic of 
the world of ideas of physics, one is first of all struck by the following: the 
concepts of physics relate to a real outside world... It is further characteristic 
of these physical objects that they are thought of as arranged in a space-time 
continuum. An essential aspect of this arrangement of things in physics is 
that they lay claim, at a certain time, to an existence independent of one 

^Besides, the concept of signalling is not entirely without conceptual difficulties as we'll see later in 
this chapter. 
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a nother , provided these objects 'are situated in different parts of space'. 



1 ft 



([BorTll, Pg. 168) 



I will encapsulate in the following axiom a weakened form of the concept of Reality 
expressed by Einstein in the above quote. 

Axiom 1 (Reality): Events occur independently of observers or frames of 
reference. 

That is, this is the assumption that the very fact that events occur is something in- 
dependent of observers or reference frames. A pebble being splashed by water on a 
deserted beach, a photo-detector signalling the arrival of a photon, the Big Bang, Ax- 
iom 1 states that the reality of those events is independent of anyone's description or 
observation. 

That's not to say that different observers can't give different accounts about where or 
when those events occurred; it's just that they will all agree, if Axiom 1 is true, that 
they in fact occurrecj^. That does not mean, of course, that all physical quantities are 
independent of observers or reference frames - some physical quantities such as velocity 
are patently relative in that way. 

Axiom 1 does not necessarily imply that there are hidden variables underlying quantum 
properties either; one could regard the interactions between quantum systems and their 
evolutions before measurement simply as not corresponding to bona-fide 'events' in the 
above sense (maybe 'virtual events' would be an appropriate term for such interactions, 
if they are regarded simply as mathematical devices) . In fact this seems to be an accu- 
rate representation of the view most commonly espoused by physicists in the orthodox 
camp, clearly expressed in Wheeler's famous maxim "No phenomenon is a phenomenon 
until i t is an observed phenomenon" or by Peres' "Unperformed experiments have no re- 
sult s" |per74 Neither does this axiom imply that all events are human-scale observable 
events; it is important to allow for the possibility that a theory will make claims about 
some events even if we can't directly know them. We need to emphasise, however, that 
one can conceive of frameworks in which this axiom does not hold, and I will return to 
this point in 12.6.31 

The other important assumption in Einstein's concept is that "these physical objects 
(...) are thought of as arranged in a space-time continuum", which I represent in the 
following axiom. 



^This is actually not a sufficient condition. It is possible that no observers disagree about the 
reality of events, and yet Axiom 1 to be false. This would only require that observers only be able to 
communicate with just those observers who would agree with them. 
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Axiom 2 (Space-time): All events can be embedded as points in a single 
relativistic space-time, where the concepts of space-like separation, light 
cones, and other standard relativistic concepts can be applied unambigu- 
ously. 

The conjunction of Axioms 1 and 2 constitute what I will call a Minimal Realist- 
Relativistic Framework (MRRF). All of the subsequent notions and theorems will as- 
sume this framework. 

While the above was stated as an ontological framework (i.e., as assumptions about how 
the world is), one can re- frame it as an epistemological framework (i.e., as assumptions 
about what one can know). So Axiom 1 could be rephrased as saying that all observers 
agree about which events they see and Axiom 2 as stating that all observers can describe 
those events consistently as corresponding to points in a single space-time. Any meta- 
physical realist (one who believes that there exists a world independent of observers) 
would take the fact that all observers agree about which events occur as evidence that 
they really occur, independently of any observers. However, even a metaphysical anti- 
realist (one who does not believe that there exists a world independent of observers) 
could accept those axioms in this weaker epistemic form, and that's the reason I include 
them here — so that anti-realists don't feel completely secure against the consequences 
of Bell's theorem. 



2.2 Causation and Bell's local causality 



Maybe surprisingly to most physicists, causation is a problematic concept in philosophy 



even for reasons independent of Bell. I won't delve into those difficulties here 
stick to the concepts which are relevant for the current purposes. 



but will 



That Bell takes his theorem as implying a conflict between quantum mechanics and 
relativity is evident in the following^: 



"For me then this is the real problem with quantum theory: the apparently 
essential conflict between any sharp formulation and fundamental relativity. 
That is to say, we have an apparent incompatibility, at the de epest l evel, 
between the two fundamental pillars of contemporary theory..." ( 
172) 



Bel8 



For a thorough philosophical treatment of causation see DowOd | . 



For a more in-depth analysis of Bell's concept than I will present here, see [Nor07b | 
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We will come soon to a derivation of this "essential conflict". But for that we need to 
understand the concept of local causality Bell takes relativity to imply. In "La nouvelle 
cuisine", Bell starts with an amusing quote from H.B.G. Casimir: 



"I want to boil and egg. I put the egg into boiling water and I set an 
alarm for flve minutes. Five minutes later the alarm rings and the egg 
is done. Now the alarm clock has been running according to the laws of 
classical mechanics uninfluenced by what happened to the egg. And the egg 
is coagulating according to laws of physical chemistry and is uninfluenced by 
the running of the clock. Yet the coincidence of these two unrelated causal 
happenings is meaningful, because, I, the great chef, imposed a structure in 
my kitchen." ([Bel87|, pg. 232) 



This passage is to illustrate a principle that has always been a hallmark of the scientific 
enterprise: whenever correlations between events occur, either one event causes the 
other or they share a common cause. This is known in philosophy of science as the 
principle of common cause, which was first formulated in a clear manner by Hans 
Reichenbach Arn05l |. Bell is thinking along very similar lines when he describes the 



relativistic principle of local causality as (with reference to Figure 12. ip : 



"The direct causes (and effects) of events are near by, and even the indirect 
causes (and effects) are no further away than permitted by the velocity of 
light. Thus for events in a space-time region 1 (...) we would look for causes 
in the backward light cone, and for effects in the future light cone. In a 
region like 2, space-like separated from 1, we would seek neither causes nor 
effects of events in 1. Of course this does not mean that events in 1 and 
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Figure 2.2: "Full specification of what happens in 3 makes events in 2 irrelevant for pre 



dictions about 1 in a locally causal theory." (reproduced with permission from |Bel87 
pg. 240. Copyright Cambridge University Press.) 

2 might not be correlated, as are the ringing of Professor Casimir's alarm 
and the re adiness of his egg. They are two separate results of his previous 



actions." 



Bel87|, pg. 239) 



The above principle, Bell admits, "is not yet sufficiently sharp and clean for mathe- 
matics." He then considers the following as an implication of the principle above (with 
reference to Figure [2?2l) : 

"A theory will be said to be locally causal if the probabilities attached to 
values of local beables in a space-time region 1 are unaltered by specification 
of values of local beables in a space-like separated region 2, when what 
happens in the backward light cone of 1 is already sufficiently specified, for 
e xampl e by a full specification of local beables in a space-time region 3..." 
dseisj pg. 239) 



The concept of "beable" was invented by Bell to contrast with the notion of "observable" 
that is fundamental in orthodox quantum mechanics: 

"The beables of the theory are those elements which might correspond to 
elements of reality, to things which exist. Their existence does not depend 
on 'observation'. Indeed observation and observers must be made out of 



beables." ((Bel87|, pg. 174) 



The above and the specification in Bell's formulation of local causality that the rele- 
vant beables are "local beables" are evidence that what he means by 'local beables' 
is precisely what I meant by 'events' in the previous section. I will stick to the latter 
terminology as it seems very non-problematic and familiar to physicists, who encounter 
the term with precisely this meaning in any undergraduate course in relativity. 

We are now ready to formulate the principle mathematically: 
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Definition 1 (Local causality): A theory will be said to be locally causal 
iff for every pair of events Ei, E2 respectively contained in space-like 
separated regions 1 and 2, the probability, posited by the theory, of oc- 
currence of event El is independent of E2, given the specification of some 
sufficient set of events Spi in the past light cone of 1, i.e., 

P{Ei\E2,Spi)=P{Ei\Spi). (2.1) 

It's important to emphasise the need only for a sufficient set of events (and not neces- 
sarily the full specification of all events in the past light cone), for a reason which will 
be clear in the next section. What counts as a sufficient set of events will, of course, 
depend on the theory. The set of events in region 3 of Figure 12.21 would be sufficient in 
a theory where all causal chains are continuous. However, one can envisage theories in 
which that does not occui@. This matter will not be too important for Bell's theorem 
as the purpose of that will be to show that no such set can exist anywhere in the past 
light cone of Ei and therefore that Local Causality fails. 



2.3 Other general definitions 

The previous Axioms and definitions were quite independent of any particular setup. 
To derive specific consequences of local causality we need to introduce some concrete 
experimental situation. The most common setting used in discussions of Bell's theorem 
is constituted by two parties, traditionally called Alice and Bob, and this is the scheme 
of Figure 12.31 The definitions and theorems of this chapter will make use of that case 
only, but most of them could be generalised in an obvious way for arbitrary parties. 

• Alice and Bob are two spatially separated observers who can perform a number 
of measurements and observe their outcomes. 

• For each pair of systems they perform measurements upon, the choices of measure- 
ment settings and their respective outcomes occur in regions which are space-like 
separated from each other, so that no signal travelling at a speed less than or 
equal to that of light could connect any two of them; 

^For example, if a theory (as is the case in orthodox quantum theory) considers as bona-fide events 
the preparation of a quantum system in a region R and the outcomes of measurements done on that 
system in a region 1 in the future light cone of R, but regards the intermediate evolution of the system 
as not corresponding to events in the sense of Axiom 1, one could setup a case in which the set of 
events in region 3 of Figure 12.21 would not be sufficient to screen off events in 1 from events in 2 even 
in the absence of entanglement. That, however, should arguably not be considered as a failure of local 
causality. 
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Figure 2.3: The bipartite experimental setup used fo r discussion of the concepts of this 



chapter, (reproduced with permission from [Bel87|, pg. 242. Copyright Cambridge 
University Press.) 

• For each pair of systems, we will denote by a and b Alice's and Bob's respective 
measurement settings, and by A and B their corresponding observed outcomes. 

• Each pair of systems is prepared by an agreed-upon reproducible procedure c. 
The events corresponding to this preparation procedure are necessarily in the 
intersection of the past light cones of the measurements yielding A and B; 

• a, b, A, B and c represent events in the sense of the MRRF; 

• A represents any further variables (in addition to a, b and c) that may be relevant 
to the outcomes of the measurements considered, that is, conditioned upon which 
the probabilities of the experimental outcomes may be further specified. They 
are not fully determined by the preparation procedure c, and as such may be 
deemed "hidden variables". They are necessarily not known in advance, since 
any knowledge of additional variables could be assimilated into the preparation 
c. Conversely, if some of the variables associated with the preparation procedure 
are unknown, they must be assimilated into A. In other words, the distinction 
between c and A is merely epistemic, i.e., c represents the set of known relevant 
variables and A represents the set of unknown (but not necessarily unknowable) 
relevant variables. Strictly speaking A describes a set of events in the union of 
the past light cones of the experiments as needed by Definition 1. However, one 
can also think of them as physical variables which are specified by those events; 

• When an equation involving variables appears, it is to be understood that the 
equality holds for all values of those variables. 

We can now specify the third axiom needed for Bell's theorem. 



Axiom 3 (External Conditionalisation, or Free will, or No backwards- 
causality): The choices of experiment a, b, can be conditioned on free 
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variables uncorrelated with X, such that knowledge of those choices does 
not provide any further information about the hidden variables, i.e., it 
does not change their probability distribution. Formally, 

P{X\a,b,c) = P{X\c). (2.2) 



In other words, the choices can be freely made, independently of any relevant variables 
that influence the outcomes of the measurements under study. This is a fundamental 
requirement of any theory in which it is possible to separate the world into the system of 
interest and the rest of the world, where the rest of the world can be ignored as irrelevant 
to the evolution of the system. Some people may picture that as an allowance for the 
experimenters to make those choices at their own free will. This picture introduces 
the inconvenience of requiring us to explain what we mean by "free will", which is 
completely besides the point. Even if the world is completely deterministic and human 
free will is an illusion, External Conditionalisation would be an almost unavoidable 
assumption of any physical theory. 

To make that clear, one can imagine that those choices are made by pointing photo- 
detectors at opposite parts of the sky and deciding based on fluctuations of the cosmic 
microwave background radiation; or that they depend on the output of a pseudo-random 
number generator; or that they are decided at the whim of an experimentalist; or by 
some further random quantum process, say, a measurement on a (presumably) uncorre- 
lated spin-1/2 particle; or by any combination of those processes. This is where we see 
the importance of being clear about the need for the specification of only a sufficient 
set of events Spi in the Definition 1. Of course the choices of experiment will depend on 
some events in the past light cone of the events under study, but a "superdeterminis- 
tic" theory would be needed to entertain the possibility that the factors on which those 
choices depend can infiuence the evolution of the system under study. In such theory, 
any possible variable which one chooses to conditionalise the choices of experiment on 
would be statistically correlated with the set of hidden variables which are relevant to 
the experiments of Alice and Bob. And they would need to be conspiratorially cor- 
related in such a way as to fool us into believing that local causality is violated by 
the correlations between those experiments while in reality the world is strictly locally 
causal. 

That said, a possible way in which equation (12. 2p could be vio lated is by way of 
backwards-in-time causality or retrocausality. Some authors |Pri96l | have claimed that 
in fact such backwards causality should be expected from the fact that fundamental 
physics is (mostly) time-symmetric, and could be the source of quantum nonlocality. 
This view, however, is far from being generally accepted. 
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Definition 2: A phenomenon is defined, for a given preparation procedure 
c, by the relative frequencies 



fiA,B\a,b,c). 



(2.3) 



for all measurements a, b, and corresponding outcomes A, B. 



The use of frequencies in that definition, instead oi probabilities, is motivated by the fact 
that it is rather uncontroversial that frequencies are the things we observe, at least to 
an arbitrarily good approximation. Whether or not frequencies directly correspond to 
probabilities will depend on one's interpretation of probabilities. Here we will make the 
distinction between the observable frequencies (the phenomenon) and the probabilities 
which are employed to predict or explain the phenomenon (the model). In general, 
models can make use of unobserved or even unobservable variables. Nevertheless, those 
variables can be taken to have an existence independent of observers, i.e., they can be 
taken to have an ontological status. Thus one can refer to such models as ontological 
models. The following is the most general kind of ontological model for the phenomena 
under consideration, in which the observed phenomena can be explained as arising out 
of our ignorance of underlying variables, where our ignorance is accounted for with 
standard probability theory, and where Axiom 3 holds. 

Definition 3: An ontological model (or model in short) for a phe- 



Nothing in the following discussions hangs on whether the A are discrete or continuous, 
so for simplicity I use discrete hidden variables. The above definition can be extended 
to a continuous set of hidden variables in the standard way. 

Definition 4: An operational theory (OT) is the class of trivial mod- 




P{A,B\a,b,c,X) (2.4) 



which predicts the phenomenon 




(2.5) 



els, i.e., the class of models for which 



P{A,B, \a,b, c. 



X) = f{A,B\a,b,c). 



(2.6) 
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That is, in operational theories no hidden variables further specify the probabilities. 
One could argue that an operational theory contains no A's, but this definition, as a 
class of models with trivial dependences on A, allows an operational theorist to talk 
about the A's even if they are not operationally meaningful. 

Definition 5: A hidden variable model (HVM) is any model that is not 
trivial. 

We will now determine what the Definition 1 of local causality implies to this experi- 
mental situation. Since a and A are space-like separated from b and B, and the set of 
hidden variables completely specifies all relevant events in the past light cone of a, A 
and b, B, we obtain 

Corollary 1: A model is locally causal, i.e., a model satisfies local 
causality (LC) iff 

P{A\a,b,B,c,X) = P{A\a,c,X), (2.7) 

plus the corresponding equations for B. 

Definition 6: A model is said to be deterministic, or to satisfy deter- 
minism (D), iff 

P{A,B\a,b,c,\) E {0,1}. (2.8) 

This implies that A and B are functions of the variables which condition those proba- 
bilities, i.e., 

A = A{a, b,c,\), B = B{a, b, c. A). (2.9) 

Definition 7: A model is said to be predictable, or to satisfy pre- 
dictability (P) iff 

P{A, B\a, b, c. A) = P{A, B\a, b, c) E {0, 1}. (2.10) 

That is, a model is predictable iff it is trivial and deterministic. This implies that A 
and B are functions of 

A = A{a, b,c), B = B{a, b, c). (2.11) 

This definition is motivated by the fact that the variables represented by c are known. 
The former definition (determinism) is ontological (about how the world is), while this 
definition (predictability) is epistemic (about what one can know). 
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Definition 8: A phenomenon $i associated with preparation C\ is said to be 
irreducibly unpredictable, or to satisfy irreducible unpredictabil- 
ity (lU) ijf it has no predictable models and there is no phenomenon $2 
associated with a preparation C2 = Ci U c' (for all d ) which has a pre- 
dictable model and such that the frequencies fi of $1 are given by 

fM, B\a, b, ci) = J2 Pic'\ci)PiA, B\a, b, c,, c'), (2.12) 

c' 

where P{A, B\a, b, ci, c') G {0, 1}. 



In other words, a phenomenon is irreducibly unpredictable iff it has no predictable 
model and cannot be rendered predictable by knowledge of further variables. The 
reason for f l2.12p is that the only way in which an unpredictable phenomenon could 
be rendered predictable without fundamentally changing the phenomenon would be if 
there were a deterministic hidden variable model which predicted the phenomenon, but 
for which the "hidden variables" could be in principle known in advance. But if the 
hidden variables were known in advance they could (in fact they should, since the only 
distinction between A and c, as mentioned before, is that the latter are known) be 
incorporated into the preparation variables. That is why I use the notation c' for those 
further knowable variables. 

Definition 9: A model is said to satisfy locality (L) ijf 

P{A\a,b,c,\) = P{A\a,c,X), (2.13) 
plus the corresponding equation for B. 

This was pre cisely the meaning that Bell intended for this term in his original 1964 
lOugh without formal definition. Shimony called this parameter inde- 



paper 



Bemi- alt 



pendence ( \CMH^ . pg. 25). 



Definition 10: A model is said to satisfy outcome independence (01) 

P{A\a, b, B, c. A) = P{A\a, b, c. A), (2.14) 
plus the corresponding equation for B. 



This concept was introduced by Jarrett |Jar84l |. under t he mis leading name of com- 
pleteness. The present terminology is due to Shimony ( |CM89l |. pg. 25). One could 
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also consider calling it causality, so that local causality would be the conjunction of 
locality and causality. This choice would be justified if one takes causality to be a 
weakened form of determinism, in which the outcomes depend (maybe stochastically) 
only on the experimental settings, but not on the distant outcomes. In other words, in 
this view of causality the effects (the outcomes, things which are not controllable) at 
Alice's can depend only on the local or distant causes (the settings, things which are 
controllable) but not on the distant effects. This terminology, however, would seem to 
rule out a common "explanation" of the quantum correlations: that the measurement on 
Bob's system causes the quantum state to collapse which causes the outcomes at Alice 
to be what they are. This (non-relativistically invariant) causal picture would violate 
outcome independence since the collapsed quantum state depends on Bob's outcome. 

Definition 11: A model is said to he locally deterministic, or to satisfy 
local determinism (LD) iff it satisfies locality and determinism. 

Definition 12: A model is said to violate p iff it lacks that property. 

Definition 13: An operational theory is said to violate p iff all trivial 
models violate p. 

Definition 14: A phenomenon is said to violate p iff all models violate 
P- 

Definition 15: Nature is said to violate p iff a phenomenon violating p 
is observed. 



2.4 General results 

We now present the general results and relations (that is, those which are not specific 
to quantum mechanics) concerning the definitions of the previous section. 

By the definition of conditional probability P{A, B\a, b, c. A) = P{A\B, a, b, c, \)P{B\a, b, ( 
Using Corollary 1 we arrive at 

Theorem 1: Local causality is equivalent to factorisability, i.e., 

P{A, B\a, b, c. A) = P{A\a, c, X)P{B\b, c. A). (2.15) 



Jarrett 



Jar84l | showed that: 



Theorem 2 (Jarrett 1984a): Local causality is the conjunction of locality 
and outcome independence. 
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The purpose of that decomposition was to argue that outcome independence was the 
concept to be blamed, while locality was the real consequence of relativity and therefore 
ought to be maintained. We'll return to this important point later. 

Theorem 3: Local causality is strictly stronger than locality. 

That is, every locally causal model satisfies locality but not vice-versa. The proof 
is simple. Theorem 1 implies the first statement, and orthodox quantum mechanics 
provides an example of a model which satisfies locality but violates local causality. 

Theorem 4: Determinism is strictly stronger than outcome independence. 

If determinism holds, the outcome A is fully determined by (a, 6, c, A). Therefore knowl- 
edge of B cannot change the probability of A if (a, 6, c, A) are specified, which is just the 
statement of outcome independence. To see the failure of the converse, just note that 
orthodox quantum mechanics for separable states violates determinism but satisfies 
outcome independence. 

Corollary 2: Local determinism is strictly stronger than local causality. 

This follows from the definition of local determinism and Theorems 2 and 4. 

Theorem 5: Predictability is strictly stronger than determinism. 

Every predictable model is obviously deterministic, but some deterministic models are 
not predictable. A trivial example is that of a deterministic model in which one does 
not know all relevant variables A (or does not know them precisely enough) but could 
know them in principle, (e.g. classical statistical mechanics), but there are models in 
which one cannot know all A even in principle (e.g. Bohmian mechanics). 

Theorem 6: Some phenomena violate predictability. 

Since the distinction between the hidden variables (A) and the variables that specify 
the preparation (c) is an epistemic one, and since a phenomenon is defined partly by 
c, there are trivial examples of unpredictable phenomena — one just needs to ignore 
some relevant variables. 

This does not imply that there are irreducibly unpredictable phenomena. I'll return to 
this question later. 
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Theorem 7: No phenomenon violates determinism. 

That doesn't mean that no model violates determinism, but that every phenomenon 
has a possible model which satisfies determinism. To see that, define Aq and Ai by 
P(y4, _B|a, 6, c, Ao) = and P(A, _B|a, 6, c, Ai) = 1. Substituting in (12.51) we obtain 
/(A, S|a, 6, c) = P(Ai|c), resulting in that every possible frequency can be modelled 
in this form. Of course, that leaves open the question of whether there really exist 
such events corresponding to the variables to play the role of Aq and Ai, so one could 
read Theorem 7 as saying that it is impossible to prove that any phenomenon violates 
determinism. 

Corollary 3: No phenomenon violates outcome independence. 

Theorem 8 (Fine 1982): A phenomenon violates local determinism iff it 
violates local causality. 

That is, any phenomenon that has a locally deterministic mo del has a locally causal 



model and vice versa. This theorem is due originally to Fine |Fin82| . Note that this 
does not mean that all locally causal models are locally deterministic and vice versa 
(which in fact is false as per Corollary 2). 

Proof. Any phenomenon that has a locally deterministic model automatically has a 
locally causal model, since all LD models are LC. To see the converse, remember that 
if there exists a locally causal model, the frequencies can be written as 

/(A, S|a, h,c) = Y, PLc{Mc)PLc{A\a, c, \)Plc{B% c, A). (2.16) 

A 

We now add extra hidden variables for each of the factors on the right hand side, in a 
similar fashion as for the proof of Theorem 7. We decompose 

PLc{A\a, c, A) = ^ P'(Aa|c, \)PLD{A\a, c, A^), (2.17) 

Aa 

where PLD{,A\a, c, Aa) = Aa G {0, 1}. With a similar decomposition for B, we obtain 



f{A,B\a,b,c) = J2PLc{Mc)PLc{A\a,c,\)PLc{B\b,c,\). 

A 

= Yl PLc{Mc)P'{XA,\B\c,\)PLD{A\a,c,\A)PLDiB\b,c,Xi^.l8) 

X,Xa,Xb 

= J2 PLD{X'\c)PLD{A\a, C, X')PLD{B\b, c, A'), 
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where we define A' = (A,A^, A^) and Pld{^'\c) = PLc('^|c)-P'(Ayi, A^jc, A), obtaining a 
locally deterministic model as desired. 



2.4.1 Local causality and signalling 

Bell was adamant in stressing that his concept of local causality was quite distinct from 
the concept of no faster than light signalling. One of the reasons for Bell's rejection of 
the importance of the concept of signalling was that he understood that it was hard 
to talk about signalling without using anthropocentric terms like 'information' and 
'controllability': 



"Suppose we are finally obliged to accept the existence of these correlations 
at long range, (...). Can we then signal faster than light? To answer this we 
need at least a schematic theory of what we can do, a fragment of a theory 
of human beings. Suppose we can control variables like a and b above, 
but not those like A and B. I do not quite know what 'like' means here, 
but suppose the beables somehow fall into two classes, 'controllables' and 
'uncontrollables' . The latter are no use for sending signals, but can be used 



for reception." (|Bel87|, pg.60) 

But he rejects the idea that signal locality be taken as the fundamental 
limitation imposed by relativity: 

"Do we have to fall back on 'no signalling faster than light' as the expression 
of the fundamental causal structure of contemporary theoretical physics? 
That is hard for me to accept. For one thing we have lost the idea that 
correlations can be explained, or at least this idea awaits reformulation. 
More importantly, the 'no signalling...' notion rests on concepts which are 
desperately vague, or vaguely applicable. The assertion that 'we cannot 
signal faster than light' immediately provokes the question: 

Who do we think we are? 

We who can make 'measurements', we who can manipulate 'external fields', 
we who can signal at all, even if not faster than light? Do we include 
chemists, or only physicists, pl ants, o r only animals, pocket calculators, or 



only mainframe computers?" (|Bel87|, pg.245) 



That is one of the reasons I have first presented the definition of local causality in 
a general context without the operational definitions used from Section 12.31 onwards. 
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In Definition 1 no mention was made of measurements and outcomes, but only of 
events and space-time structure. We have then translated the consequences of that 
into our operational definitions with Corollary 1. But to even start talking about 
signalling, we need to have the operational model set up and explicit mention what 
are the controllable, uncontrollable and observable variables within it. This is precisely 
why Bell rejects the idea that this is a fundamental notion. I agree with Bell that if 
one wants to entertain the idea that signal locality is the fundamental restriction from 
relativity, one needs to put fundamental weight on epistemological terms. However, as 
long as one clearly understands that, I have no qualms with that position, in fact I 
find it an interesting possibility to pursue, and in this chapter it will lead to a nice new 
result. 

Given the assumptions that the variables a and b are controllable (an assumption, in 
fact, already made in Axiom 3) we can formulate the concept of signal locality as follows. 

Definition 16: A phenomenon is said to satisfy signal locality (SL) iff 

f{A\a,b,c) = f{A\a,c), (2.19) 

plus the corresponding equation for B. 



The reason is straightforward. If the phenomenon violates signal locality, then there 
exist at least two possible choices of setting b, b' such that f{A\a,b,c) ^ /(y4|a, 6', c). 
Therefore by looking at the frequency of outcomes of A in a large enough ensemble 
(and in principle it is always possible for Alice to make all of the measurements in 
her ensemble space-like separated from all measurements in Bob's ensemble), Alice can 
determine with arbitrary accuracy what setting Bob has chosen. Conditionalising this 
choice on a source of information. Bob can thereby send signals to Alice. 

Definition 17: A model is said to satisfy signal locality (SL) iff the 
phenomena it predicts satisfies SL. 

The reason behind Jarrett's preference for locality (mentioned below Theorem 2) over 
outcome independence is that Jarrett believed that locality was equivalent to signa l 
locality. However, that is an unwarranted assumption as argued by Maudlin |Mau94 |. 
The main counter-example is Bohmian mechanics. That theory violates L but not SL. 
The reason is that any attempt of controlling the distant outcome by the local choice 
of setting is thwarted by an unavoidable lack of knowledge of the hidden variables. On 
the other hand, if a model satisfies L it necessarily satisfies SL, as can be easily seen by 
substituting Eq. (12.131) in (12. 5p and summing over B. In other words. 



24 



Concepts of Experimental Metaphysics 



Theorem 9: : Locality is strictly stronger than signal locality. 

While Jarrett's rejection of models which violate locality is not well-grounded, his re- 
pulse of violations of locality is. 

Theorem 10: A phenomenon violates locality iff it violates signal locality. 

As argued above, if a phenomenon has a model which satisfies locality then the phe- 
nomena satisfies signal locality. The converse is also true: if a phenomenon satisfies 
signal locality then there is a model which satisfies locality: one example is the trivial 
model that corresponds to that phenomenon. 

Theorem 11: // relativistic invariance is assumed, a phenomenon which 
violates signal locality will lead to contradictions. 

Faster than light signalling can lead to paradoxes like the famous "grandfather para- 
dox" of time travel — a time traveller goes to the past and kills his grandfather before 
his father was born, paradoxically thwarting the possibility of his own existence. Con- 
sider a scenario in which signal locality is violated by Alice's and Bob's experimental 
apparatuses such that /(Aijai, 6i, Ci) ^ f{Ai\ai,h'i,ci). How exactly these frequencies 
differ is unimportant, as Alice can always use an arbitrarily large ensemble such that 
f{A'^\bi) ~ 1 and /(A'Jfo'^) ^ 0, where A'^ is the event which corresponds to the fre- 
quency of the outcome Ai within the ensemble being approximately f{Ai\ai^ ci) and 
with good confidence distinct from /(/li|ai, 6'^, Ci). Since A'^ and the choice between 
hi and h'^ are space-like separated, there exists a reference frame in which they occur 
simultaneously. If relativistic invariance is assumed, then it must be possible to pro- 
duce, in any reference frame, a similar instantaneous signalling setup (otherwise there 
would be a preferred reference frame), and if we assume that there's no preferred spatial 
direction for such signal transmission (if there were a preferred spatial direction, that 
would also be a violation of the principle of relativity), we can setup another pair of 
boxes where /(i?2|a2, ^2, C2) 7^ /(i?2|a2, &2, C2). The choice 02/02 of Alice's experimental 
setting would be conditioned on A'^ (and therefore would be in the future light cone 
of A'^) and by use of a similar procedure as for the first setup, Alice and Bob arrange 
things such that /(i?2|^'i) ~ 1, /(-B2l~'^i) ~ 0? where '-1' denotes logical negation. 
Bob's outcome B'2 is arranged to be in the past light cone of the choice between hi and 
h\ , and Bob conditions that choice such that he will choose h'^ if the result of measure- 
ment 2 is i?2, otherwise he will choose hi. So we finally obtain the contradictory set of 
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implications 

61 ^ a; ^ ^2 ^ 6'i ^ 

6; ^ ^a; ^ ^ 61 ^ ^6;. (2.20) 

That is, if Bob chooses bi, then he does not choose 61, If Bob chooses b[, then he 
does not choose b[. So it is fair to say that relativity seems to exclude the possibility 
of violation of signal localitj^. Bell would say that this is not all it excludes, that 
relativity implies not simply signal locality but local causality. An interesting question 
is therefore whether violation of local causality by a phenomenon which does not violate 
signal locality can lead to such time travel paradoxes. If signal locality is satisfied, the 
above scenario cannot be set up, and the fact that we have observed violations of local 
causality (up to some loopholes) seems to point to the fact that no contradictions can 
arise out of that violation (inasmuch as a contradiction cannot be actually observed). 
However sensible this statement may sound, I can't provide a proof to it. But it is 
interesting to mention it as a conjecture. 

Conjecture 1: A phenomenon that violates local causality but not signal 
locality does not lead to contradictions even if relativistic invariance is 
assumed. 

Remark 1: Nevertheless, a phenomenon that violates local causality is a 
non-local resource. That is, there are tasks Alice and Bob can do 
using this phenomenon that they could not do if they had access only to 
locally causal phenomena. 

Theorem 12: For a phenomenon to violate locality it is necessary and 
sufficient for the operational theory to violate locality. 

This follows directly from Theorem 10 and the definitions of operational theory and 
signal locality. 

Theorem 13: For a phenomenon to violate local causality it is necessary 
but not sufficient for the operational theory to violate local causality. 

The necessary part is obvious. We can show insufficiency by appealing to a counter- 
example: single particle quantum mechanics restricted to measurements of position. 



^Although there are several attempts to make sense of this kind of paradox in the philosophical 
literature about time travel. However, most attempts either violate Axiom 2, by postulating multiple 
universes, or Axiom 3, by restricting the possibility of conditionalising the choices of experiment on 
any variables we choose, thereby blocking the setup of the paradox from the start. If one wants to keep 
those axioms, I am not aware of any clean way around the paradox except enforcing signal locality. 
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The operational theory violates local causality (see Theorem below) but there's a locally 
causal model for the phenomena involved: single-particle Bohmian mechanics. 

Remark 2: Thus, unlike locality, to say whether a phenomenon violates 
local causality it is necessary to consider hidden variable models. 



2.5 Quantum Results 

Definition 18: Orthodox quantum theory or operational quantum 
theory (OQT) is an operational theory in which 

P{A, B\a, b, c, A) = Tr[UA ® UbPc], (2.21) 

where IIa is a projector onto the subspace corresponding to outcome A of 
observable a and similarly for B , while pc is a positive unit-trace operator 
associated with the preparation c. 

Definition 19: A quantum model is a model whose predictions agree 
with OQT. 

Definition 20: A quantum phenomenon is a phenomenon predicted by 
OQT. 

Theorem 14 (Einstein 1927): OQT violates local causality. 



This was first proved by Einstein at the 1927 Solvay conference [Wic95|. Consider a 
single particle with a position wavefunction spread out over space, and consider a as a 
measurement of the particle position in the region around Alice and similarly for Bob. 
Then the probability that Alice finds the particle in her region is not independent of 
whether Bob finds the particle in his region. And since in OQT there are no hidden 
variables, (12.71) is not satisfied. 

Theorem 15 (Heisenberg 1930): OQT does not violate locality. 

This is a well-known result that arises out of the fact that operators corresponding 



to measurements in spatially separated regions commute. In his 1930 book |Hei30l |. 
Heisenberg replied to Einstein's objection to OQT above by pointing out that the 
theory would not allow faster-than-light signalling. 

Corollary 4: Quantum phenomena do not violate signal locality. 
Theorem 16 (Jarrett 1984b): OQT violates outcome independence. 



This is a consequence of Jarrett's Theorem 2, and Theorems 14 and 15. 
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2.5.1 The EPR argument 

As we mentioned above, in 1927 Einstein had already shown that OQT violated local 
causality (the term wasn't used until Bell, but Ei nstein clearly stated that quantum 



mechanics "contradicts the principles of relativity" |Wis06l |). That was always intended 
by Einstein as a proof that OQT was incomplete, i.e., that there must be further hidden 
variables beyond the quantum state to completely specify the properties of physical 
systems. 

Einstein's 1927 argument however is not as widely recognised as the Einstein, Podol- 



sky and Rosen (EPR) argument of 1935 |EPR35l |. Maybe because the argument in 



that paper was more well articulated, maybe because in the Solvay conference Einstein 
also tried to prove quantum mechanics to be inconsistent — not only incomplete — 
by carefully set-up thought experiments. Those attempts were thwarted by Bohr, who 
argued that Einstein was not consistently using the uncertainty principle at all levels in 
his analysis of the experiments. The physics community largely takes Bohr to have tri- 
umphed over Einstein on those arguments, and therefore Einstein's failure on that front 
would have been automatically transferred to his 1927 theorem about local causality. 



In a recent paper |HS07l |. Harrigan and Spekkens suggest that Einstein himself preferred 
the (more complicated) argument for incompleteness using entangled states. More 
precisely, Einstein's preferred argument was that used in a 1935 correspondence with 
Schrodinger, not the EPR argument which — those authors claim — does not reflect 
precisely Einstein's opinion on the matter. The reason for that preference, they argue, 
is that this argument not only rules out the completeness of quantum mechanics (if one 
assumes local causality, of course), but also it provides an extra argument for the view 
that the quantum states are merely epistemic in nature. 



Another reason for preference of the EPR argument, mentioned in |Wis06l | and |HS07l | 
over Einstein's 1935 is an experimental one: the measurement statistics used in the 
1927 argument can be trivially simulated with a mixed state, while those of the EPR 
argument cannot. The critics could evade the earlier argument by denying the coherence 
of the state upon which the measurements are performed, a move that cannot be made 
in the later. 

But the most likely reason, in my opinion, for the community's recognition of the EPR 
argument over the 1927 one is that the former also substantially differs from the later in 
that it introduces entangled states. Even if EPR failed in convincing the community of 
the incompleteness of orthodox quantum mechanics, the kind of states they considered 
opened the door to Bell's recognition of the failure of one of EPR's premises instead 
— local causality — and from there lead to the multiple applications in the modern 
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field of quantum informa tion sci ence. They also influenced Schrodinger to envisage his 
infamous "cat paradox" |Sch35| (and to coin the term 'entanglement' to refer to the 
strange kind of states EPR consider). 

EPR's argument is essentially that given (i) a suitable necessary condition for complete- 
ness of a theory; (ii) an apparently reasonable sufficient condition for determining when 
a physical variable corresponds to an "element of physical reality"; (iii) the assumption 
of local causality; and (iv) some predictions of quantum mechanics concerning entan- 
gled states; one must conclude that quantum mechanics is incomplete, in the sense that 
there must exist hidden variables to further specify physical states. I won't go into the 
EPR argument in detail here, however, as it will be the subject of Chapter [3l 



2.5.2 Bell's Theorems 

We now arrive at the famous Bell theorems. By 1964 it was known that OQT violates 
local causality, but one could still imagine, as EPR did, that a more detailed description 
of the phenomena was possible in which local causality was maintained. Bell's startling 
contribution was to show that this hope was futile. The importance of this theorem 
cannot b e overe mphasised. It has even been dubbed the "most profound discovery of 



science" 



Sta77|. In 1964, however. Bell did not prove the stronger theorem about local 



causality, but only a weaker version: 

Theorem 17 (Bell 1964): All deterministic quantum models violate lo- 
cality. 

In other words, quantum phenomena violate local determinism. By Fine's Theorem ^ 
this is equivalent to the strong form of Bell's theorem: 

Theorem 18 (Bell 1971): Quantum phenomena violate local causality. 

However, that was not fully recognised until much later. In 1964 Bell was considering 
locality and not local causality, as is evident by this passage: 

"It is the requirement of locality, or more precisely that the result of a 
measurement on one system be unaffected by operations on a distant system 
with wh ich it has interacted in the past, that creates the essential difficulty." 



([BeieJl, pg. 14) 



and that he considers only deterministic hidden variables is clear in the following dis- 
cussion of the specific setup: 
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"The result A {...) is then determined by a and A, and the result B (...) 
is determined by b and A, and A{a, X) = ±1, B{h,\) = ±1. The vital 
assumption is that the result B (...) does not depend on the setting a, (...) 
noiA on 6." ( |Bel64| - Pg- 15) 



In a 1971 paper (|Bel87|, pg. 37), Bell considered a model which allowed some "inde- 
terminism with a certain local character" associated with the detectors — essentially a 
factorisable model with arbitrary probabilities, which by Theorem 1 follows from local 
causality — so this could be taken as the first clear proof of Theorem 18. But he did 
n ot clea rly use the term local causality with the meaning of Definition 1 until 1976 
([BelSTfl, pg. 54). 

While in hindsight Bell's 1964 and 1971 theorems are logically equivalent (by way of 
Theorem 8), the 1964 theorem could not be directly applied to experimental situations, 
since it assumed perfect correlations (which are obviously not observable in any real 
experiment). The Bell inequality of 1971 however ( essentially a version of the Clauser, 
Horne, Shimony, Holt (CHSH) inequality of 1969 [CHSH69|, but which was derived 
from a local deterministic model) is applicable to real experimental situations. Many 
experiments realised since then strongly follow the quantum mechanical predictions, 
and (up to some loopholes involving detection efficiencies and/or lack of space-like 
separation) support the conclusion 



Conclusion 1: Nature violates local causality. 



2.5.3 Determinism and Predictability 

Theorem 19 (Born): OQT violates determinism. 

This is essentially the content of Born's postulate that the modulus square of the 
wave function corresponds to probabilities of outcomes of measurements, the extension 
of which for mixed states is given by (12.2ip . Since for any state (determined by a 
preparation c) there is at least one measurement for which the probabilities are different 
from 1 or 0, determinism is violated by OQT. 

Corollary 4: OQT violates predictability. 

Corollary 5: Quantum phenomena violate predictability. 

That is a consequence of Corollary 4 plus the fact that the definition of predictabil- 
ity implies that if a model is not predictable, then the phenomena it predicts violates 
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predictability. This still does not mean that quantum phenomena are irreducibly un- 
predictable, that is, it doesn't mean that there cannot be further knowable variables 
such that the phenomena would be rendered predictable. That there are no such fur- 
ther knowable variables is essentially the content of Heisenberg's Uncertainty Principle 
(HUP), that is, 

Theorem 20 (Heinsenberg 1926): Quantum phenomena are irreducibly 
unpredictable. 

But the proof of Heisenberg's uncertainty principle, based on the commutation relations 
between different observables, can only be made within quantum mechanics. Bohmian 
mechanics is deterministic but also not predictable, but again those facts are model- 
dependent. The definition of quantum phenomenon is "a phenomenon predicted by 
OQT". OQT states that some phenomena are irreducibly unpredictable (by way of 
HUP), but OQT could be wrong about that. However, Theorem 7 (which states that 
no phenomenon violates determinism) may be taken to imply that nothing can be said 
about predictability in a model-independent way. 

This debate was in the centre of the famous Einstein-Bohr debates starting in the 
Solvay conference of 1927. Einstein was then unsatisfied with the indeterminism of 
quantum mechanics and attempted to show that it was inconsistent by devising clever 
gedanken experiments aiming to break Heisenberg's Uncertainty Principle. But Bohr 
thwarted every such attempt by pointing a fiaw in Einstein's reasoning. History is 
one the side of Bohr who is widely regarded as the victor of those debates. Einstein's 
flaw was essentially to ignore the uncertainty principle for some variables, which would 
allow him to obtain more knowledge than permitted by OQT for some other variables. 
Bohr's reply was to point that if the principle was observed consistently throughout 
the problem, Einstein's move would fail. However, of course, Bohr could never prove 
that OQT was correct and that the HUP must always hold, all he did was point out 
that OQT was consistent. There is no question about that, but could he give a better 
argument to convince Einstein that OQT was correct about that? On this question I 
offer the following theorem. 

Theorem 21: // relativistic invariance is assumed, Nature is irreducibly 
unpredictable or some observed phenomena can lead to contradictions. 

The contradictions in question are just those involved in the grandfather-type paradox 
considered in Theorem 11. The proof is simple. Suppose Nature is not irreducibly 
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unpredictable, i.e., that for any observed phenomenon there is a possible trivial deter- 
ministic model that reproduces the phenomenon as in (|2.12p . By Theorem 17 there are 
phenomena for which there is no local deterministic model, therefore any such deter- 
ministic model must be nonlocal. However, since the models under consideration are 
trivial, if they are nonlocal the phenomena they predict must violate signal locality. 
Therefore if those phenomena are not irreducibly unpredictable then they violate sig- 
nal locality and by Theorem 11 will lead to contradictions if relativistic invariance is 
assumed. Moreover, such phenomena violating local determinism have been observed, 
therefore if Nature is not irreducibly unpredictable then some observed phenomena can 
lead to contradictions (if one obtains the hitherto hidden but observable information to 
render the phenomena predictable). 

In the last of Einstein's thought experiments, Bohr's reply made use of Einstein's own 
General Theory of Relativity to reject Einstein's thought experiment. Of course, Ein- 
stein or Bohr were not aware of Bell's Theorem, but interestingly, if they were, Bohr 
could, by the use of Einstein's Special Theory, have convinced Einstein not only that 
OQT was consistent, but that Nature is irreducibly unpredictable regardless of quantum 
mechanics. 

Conclusion 2: The above definitions and theorems imply the following 
structure in phenomenon space (PS): 



vSL = vLc vLD = vLC C PS, 
{} = vD C vP C PS (2.22) 

Here {} denotes the empty set, vSL denotes the set of phenomena vio- 
lating signal locality, and similarly for vL, vLC, vLD, vD and vP. 



2.6 Making sense of it 

In the beginning of this Chapter I indicated that there are, still today, many discussions 
about what Bell's theorem(s) prove and what concepts it requires us to give up. Here 
I'll sketch a few of those debates and what we can conclude about them in light of the 
careful definitions and theorems of this chapter. 
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2.6.1 Locality and outcome independence 

I have already mentioned Jarrett's preference of rejecting outcome independence and 
keeping locality. The motivation is essentially Theorems 9 and 10, which state that 
violation of locality by a phenomenon will lead to paradoxes if relativistic invariance 
is assumed. However, before rushing into conclusions, we must remember that the 
concept of a phenomenon violating a property is quite distinct from that of a model 
lacking that property. In fact, we have seen that it is possible for a model to violate 
locality while the related phenomena do not. So keeping locality does not necessarily 
lead to paradoxes. A desire to avoid paradoxical situations is not sufficient reason to 
reject locality. 

Bell's theorem states that quantum phenomena violate local causality. Heisenberg's 
Theorem 15 says that quantum phenomena do not violate locality. One could take that 
to mean that quantum phenomena violate outcome independence. But by Corollary 
3, it is impossible for any phenomenon to violate outcome independence. The choice 
will depend on the model. Some models respect locality but not 01, such as OQT. 
Some respect 01 but not locality, such as Bohmian mechanics. Some respect neither, 
as Nelson's mechanics. Bell's theorem says that none can respect both. 

2.6.2 Determinism and hidden variables 

There's a common misconception in the literature about Bell inequalities which main- 
tains that what Bell's theorem tells us to give up is determinism and/or hidden vari- 
ables. That probably arises from the fact that the original 1964 Bell theorem (and 
many subsequent derivations) was in fact about the failure of local determinism, that 
is, the conjunction of locality and determinism. For a reader who was already used to 
the violation of determinism by orthodox quantum mechanics, and who didn't notice 
the fact that violation of locality by a model did not imply (the truly objectionable) 
violation of signal locality, the choice seemed clear. And since the early models did 
not mention the possibility of nondeterministic hidden variables, it is understandable if 
those readers then took Bell's theorem as definitive proof of the failure of the project 
of hidden variables. 

Curiously, that was quite the opposite from Bell's intent. Bell was a supporter of the 
hidden variable program, and his purpose was to show that it was not fair to reject 
Bohmian mechanics — the leading contender among the hidden variable theories — 
due to its nonlocality, since that was an unavoidable feature of any hidden variable 
model. 
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In any case, we now know that what is at stake is not just local determinism, but the 
weaker concept of local causality. And by Remark 2, to even consider the concept of 
a phenomenon violating local causality, one needs to consider hidden variable theories. 
And even if one has other reasons to reject hidden variables, one cannot avoid the fact 
that OQT violates local causality. In other words, rejecting determinism and/or hidden 
variables does not make the world an entirely local place. 

A common reason for rejecting hidden variables, Bohmian mechanics in particular, is 
that this theory is deterministic but does not allow predictability. If the hidden variables 
exist, why can't we know about them? Theorem 21, while not directly answering the 
question, points out to the fact that (unless relativistic invariance is violated) Nature is 
irreducibly unpredictable, so that this problem is not exclusive of Bohmian mechanics 
(just as nonlocality, Bell showed, isn't either). 

With no intention of being a supporter of hidden variables, it's interesting to point 
out that there are other reasons to consider them: they are potential solutions to the 
measurement problem, in that they do not make 'measurement' a fundamental feature 
of the world like OQT, they make sense of quantum cosmology, and they allow an 
ignorance interpretation of probabilities. 

2.6.3 Reality 

Finally, some theorists defend the idea that what violations of Bell's inequalities tell 
us to give up is realism. These people take the term 'local realism' to be a conjuntion 
of 'locality' and 'realism', and favour to reject the latter. We can identify four main 
views in which such a conjunction, in our terminology, can consistently be taken. In 
the first. Axioms 1 to 3 are accepted implicitly and realism is equated with hidden 
variables. This view, therefore, falls under the same criticism pointed out in 12.6.21 
In the second. Axioms 1 to 3 and the possibility of HVs are accepted implicitly, but 
one equates 'realism' with outcome independence or determinism. This would make 
sense if one believes that the only acceptable hidden variable models are those that are 
deterministic, or at least that satisfy outcome independence. Then one can consistently 
reject 'realism' in this sense and keep locality. However, the holder of this view cannot 
escape the fact that local causality is still violated by OQT. In a third possibility. Axioms 
1 to 3 are accepted implicilty, all the concepts defined through use of an ontological 
model are put under the label 'realism', and 'locality' is taken to mean signal locality. 
In this view, therefore, 'locality' is not violated, and one refuses to talk about anything 
that goes under the name 'realism'. This, however, cannot be more than a form of 
operationalism, for which the only escape from the fact that OQT violates local causality 
would be to hold that local causality is not an interesting concept. 
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A fourth possibility is to reject Axioms 1 or 2. However, the motivation for that cannot 
be simply the desire to keep local causality, because the MRRF had to be assumed 
before one can even define local causality. So a theorist who chooses to reject realism 
probably has other motivations than simply to save local causality. 

That said, there are possible moves in that direction. A popular one is Everett's (or 
Many- Worlds) interpretation |Eve57l |. which maintains that there is in reality a single, 
unitarily-evolving, wave function for the universe (or multiverse?). Therefore in a sense 
all possible outcomes of all experiments (and of all particle interactions throughout the 
universe for that matter) occur. Axiom 2 is clearly violated. Axiom 1 is not neces- 
sarily violated, since events really occur somewhere in the multiverse independently of 
observers. It's just that we don't have direct access to all of them. 

ational view of quantum states, along the 



Rov96l |. In this view, quantum states are 



Yet another possibility is to maintain a re 
lines supported for example by Rovelli in 
always relative to an observer or reference frame. It is possible that different observers 
will disagree about whether or not some events occur, and in a relational world there's 
no matter of fact about such events independently of any observer. Axiom 1 is clearly 
violated. Axiom 2 may or may not be, depending on how one pursues the idea. It is 
possible that relational quantum theories of gravity (i.e., of space-time geometry) will 
also reject it. To emphasise the meaning of a violation of Axiom 1, let me be explicit: in 
a fully relational theory, it is possible that the very existence of events are not absolute 
facts independent of reference frames. This would be a startling conclusion, far beyond 
the mere relativity of space and time which we have come to accept and — dare I say 

— understand. Can reality itself be relative? Although this idea goes dangerously close 
to a radical solipsism, a more moderate reading may be possible. In special relativity, 
the fact that lengths or time lapses are relative does not entail that they don't exist 

— they just have this previously unsuspected property. In the same vein, to say that 
events are relative to reference frames does not entail that they don't exist — again, it 
is just another surprising property. However, I would say that although it is intriguing, 
whether the whole idea ultimately makes sense (and most importantly, whether it can 
lead to new predictions) remains to be seen. 



Chapter 3 

The EPR paradox and Steering 



As outli ned in Section [2], in a seminal 1935 paper, Einstein, Podolsky and Rosen (EPR) 



EPR35l | demonstrated an inconsistency between the premises that go under the name of 



local realism and the notion that quantum mechanics is complete. EPR never regarded 
it as a paradox, but as an argument to prove the incompleteness of quantum mechanics. 
The name 'paradox' was probably introduced by those who could not believe with EPR 
that quantum mechanics was indeed incomplete but could not see a flaw in the argument 
either. In hindsight, we now know (since Bell) that, while the argument is sound, one 
of the premises — local causality — is false. However, we will retain the historically 
prevalent term 'paradox^. Our reasons to study the EPR paradox are threefold. 

Firstly, we aim to do historical justice to EPR and put their argument in their correct 
standing, distilling its essence and formalising it to make it clear how it relates to the 
notion of local causality as used in discussions of Bell's theorem and to the notion of 
entanglement or quantum non-separability. 

Secondly, we will see that the EPR paradox can be demonstrated in a loophole-free 
way with current technology, thereby providing conclusive evidence of the failure of 
its underlying premises, as opposed to the current situation with Bell inequalities, as 
pointed out in Chapter [T] on page [H 

Thirdly, we will relate the EPR paradox to the concept of steering originally defined 
by Schrodinger Sch35 | in a reply to EPr|^ and recently formalised by Wiseman and 

^The American Heritage Dictionary defines 'paradox' as 1. A seemingly contradictory statement 
that may nonetheless be true. 2. One exhibiting inexplicable or contradictory aspects. 3. An assertion 
that is essentially self-contradictory, though based on a valid deduction from acceptable premises. 4. 
A statement contrary to received opinion. Our usage of the term is therefore in accordance with 
definition 3 if one takes the premises of the EPR argument to be acceptable (in which case we arrive 
at a contradiction) and with definition 4 if, instead, one takes the argument to imply the failure of one 
of those a priori reasonable premises. 

^In this same seminal article he coined the term entanglement and introduced his infamous cat 
paradox. 
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The EPR paradox and Steering 



co-authors 



WJD07|, confirming the latter's claim§ that any demonstration of the EPR 



paradox is also a demonstration of steering. We will further show that the converse 
is also true: any demonstration of steering is also a demonstration of the EPR para- 
dox. With reference to that work we will then see that the EPR paradox (or steering) 
constitutes a new class of (non-) locality intermediate between the classes of quantum 
(non-)separability and Bell (non-)locality. This classification could prove important in 
the context of Quantum Information Processing, therefore it is desirable to formulate 
criteria to determine to which classes a given state (or a set of correlations) belongs. 

The ori ginal EPR paradox was based on position and momentum observables. Bohm 



Boh5l| extended the example of EPR to the case of discrete observables, particularly 



to the case of two spin-1/2 particles. That is the version that was used by Bell in 
deriving his famous inequalities and it has played a central role in our understanding 
of quantum entanglement. Both the original argument of EPR and Bohm's version, 
however, rely on perfect correlations, which are obviou slv exp erimentallv unattainable. 
That situation was corrected by Margaret Reid in 1989 [Rei89|, when she derived exper- 
imental criteria for demonstration of the EPR paradox which were applicable to realistic 
situations where noise and losses are inevitable. The Reid criteria are a standard tool 
in Quantum Optics, and have been used for demonstrations of the EP R paradox with 
continuous- variables 
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BSLR03, 



HBBB04l | where quadra- 



ture measurements play the role of the position and momentum observables. 

However, there is to d ate no such tool for the case of discrete observables. Experiments 
by Wu and Shaknov |WS50l | gave evidence for discrete EPR correlation, but because 
detection efficiencies were extremely low, only a small fracti on of emitted pairs were 
detected, meaning that "no-enhancement" assumptions |CS78l | were incorporated. 

We will derive new criteria that can be applied to discrete observables, both for the 
case originally envisaged by Bohm and to other classes of states, even in presence of 
inefficiencies in the preparation or detection procedures. The extent to which these 
inefficiencies are allowed will be studied in detail. Using one of these criteria, we 
show that the loop-hole free demonstration of the EPR-Bohm paradox is predicted for 
considerably lower detection efficiencies than required for Bell's theorem. This disparity 
is even more striking in the case of macroscopic fields, where we propose for feasible 
efficiencies to demonstrate a type of EPR-Bohm correlation from which one can deduce 
existence of mesoscopic and macroscopic quantum superpositions. 



^While their claim is correct, their proof was incomplete. They consider a particular instance of 
the EPR- Reid criteria (a more general instance of which we will analyse in Section 13. 2p and show that 
for a certain class of states that criterion is violated if and only if the state is steerable. However, 
the EPR-Reid criteria are a subset of all possible criteria for the EPR paradox and their proof only 
considers a specific class of states. In this thesis we will see that the EPR paradox and steering are in 
fact quite generally equivalent. 
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While not enough to falsify EPR's local realism, the proposed EPR paradox experiments 
do demonstrate a particularly strong form of entanglement. For this EPR-entanglement, 
local realism can only be reconciled with quantum mechanics if one accepts the existence 
of an underlying localised hidden variable (non-quantum) state. Put another way, if 
one can accept only quantum states, then the EPR correlation implies nonlocal effects. 



3.1 The Einstein-Podolsky- Rosen argument 

We will start with a detailed analysis of the original EPR argument, before finding 
a suitable mathematical formulation of it. The EPR paper starts with a distinction 
between reality and the concepts of a theory, followed by a critique of the operationalist 
position, clearly aimed at the views advocated by Bohr, Heisenberg and the other 
proponents of the Copenhagen school. 



"Any serious consideration of a physical theory must take into account the 
distinction between the objective reality, which is independent of any theory, 
and the physical concepts with which the theory operates. These concepts 
are intended to correspond with the objective reality, and by means of these 
concepts we picture this reality to ourselves. 

In attempting to judge the success of a physical theory, we may ask ourselves 
two questions: (1) 'Is the theory correct?' and (2) 'Is the description given 
by the theory complete?' It is only in the case in which positive answers 
may be given to both of these questions, that the concepts of the theory 
may be said to be satisfactory. The correctness of the theory is judged by 
the degree of agreement between the conclusions of the theory and human 



experience." 



EPR35l | 



Any theory will have some concepts which will be used to aid in the description and 
prediction of the phenomena which are their subject matter. In quantum theory, 
Schrodinger introduced the concept of the wave function and Heisenberg described 
the same phenomena with the more abstract matrix mechanics. EPR argue that we 
must distinguish those concepts from the reality they attempt to describe. One can see 
the physical constructs of the theory as mere calculational tools if one wishes, but those 
authors warn that one must be careful to avoid falling back into a pure operationalist 
position; the theory must strive to furnish a complete picture of reality. 

In Chapter [2l we argued that a minimal realist-relativistic framework for physical the- 
ories, and certainly the position advocated by Einstein, can be represented by the 
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conjunction of Axioms 1 and 2 of that chapter, namely that the existence of physical 
events is independent of observers or reference frames and that those events can be as- 
sociated to points in a relativistic space-time. This framework makes explicit, as EPR 
desired, that events are among those things which are part of the "objective reality, 
which is independent of any theory". With that framework in place, we can abstract 
out any specific concepts of a theory and represent the most fundamental aspects of 
any description of a phenomenon by what was termed an 'ontological model' or simply 
'model'. EPR's requirement that the theory is correct is built into Definition 2 of a 
model by requiring that it correctly predicts the phenomenon. Their requirement that 
it be complete will be addressed within the ontological model shortly. But first let us 
look at the rest of EPR's argument. 

EPR follow the previous considerations with a necessary condition for completeness: 



EPR's necessary condition for completeness: "Whatever the meaning 
assigned to the term complete, the following requirement for a complete 
theory seems to be a necessary one: every ele ment of t he physical reality 
must have a counterpart in the physical theory. |EPR35| 



Soon afterwards they note that this condition only makes sense if one is able to decide 
what are the elements of the physical reality. Contrary to a common belief, they did 
not then attempt to define element of physical reality. Instead, they provide a sufficient 
condition of reality: 



EPR's sufficient condition for reality: "The elements of the physical 
reality cannot be determined by a priori philosophical considerations, but 
must be found by an appeal to results of experiments and measurements. 
A comprehensive definition of reality is, however, unnecessary for our pur- 
pose. We shall be satisfied with the following criterion, which we regard 
as reasonable. //, without in any way disturbing a system, we can predict 
with certainty (i.e., with probability equal to unity) the value of a physical 
quantity, then there exi sts an e lement of physical reality corresponding to 
this physical quantity." |EPR35 | 



Later in the same paragraph it is made explicit that this criterion is "regarded not 
as a necessary, but merely as a sufficient, condition of reality". This is followed by a 
discussion that, in quantum mechanics, if a system is in an eigenstate of an operator 
A with eigenvalue a, by this criterion, there must be an element of physical reality 
corresponding to the physical quantity A. "On the other hand", they continue, if the 



3.1 The Einstein-Podolsky- Rosen argument 



39 



state of the system is a superposition of eigenstates of A, "we can no longer speak of 
the physical quantity A having a particular value". After a few more considerations, 
they state that "the usual conclusion from this in quantum mechanics is that when the 
momentum of a particle is known, its coordinate has no physical reality" . We are left 
therefore, according to EPR, with two alternatives: 



EPR's dilemma: "From this follows that either (1) the quantum-mechanical 
description of reality given by the wave function is not complete or (2) when 
the operators corresponding to two physical quant ities do not commute the 
two quantities cannot have simultaneous reality." |EPR35| 



They justify this by reasoning that "if both of them had simultaneous reality — and 
thus definite values — these values would enter into the complete description, according 
to the condition for completeness". And in the crucial step of the reasoning: "If then 
the wave function provided such a complete description of reality it would contain these 



lis not being the case, we are 
BMnd | have pointed out that 



values; these would then be predictable [my emphasis]. T 
left with the alternatives stated". Brassard and Methot 
strictly speaking EPR should conclude that (1) or (2), instead of either (1) or (2), since 
they could not exclude the possibility that (1) and (2) could be both correct. However, 
this does not affect EPR's conclusion. It was enough for them to show that (1) and (2) 
could not both be wrong, and therefore if one can find a reason for (2) to be false, (1) 
must be truC- 

The next section in EPR's paper intends to find a reason for (2) to be false, that is, 
to find a circumstance in which one can say that there are simultaneous elements of 
reality associated to two non-commuting operators. They consider a composite system 
composed of two spatially separated subsystems Sa and Sb which are prepared, by way 
of a suitable initial interaction, in an entangled state of the type 

1^) = Xl^"!^")^® (3-1) 

n 

where the \ipn)A denote a basis of eigenstates of an operator, say Oi, of subsystem Sa 
and \un)B denote some (normalised but not necessarily orthogonal) states of Sb- If one 



^Brassard and Methot's conclusion that the EPR argument is logically unsound is not based on 
this mistake, which they acknowledge as irrelevant. Their conclusion is based on a misinterpretation 
of EPR's paper. They read the quote "In quantum mechanics it is usually assumed that the wave 
function does contain a complete description of the physical reality [...]. We shall show however, that 
this assumption, together with the criterion of reality given above, leads to a contradiction", as stating 
that ^(1) A (2) false. If that was the correct formalisation of the argument I would agree with 
their conclusion. However, by "criterion of reality given above" EPR mean their sufficient condition 
of reality, not statement (2). 
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measures the quantity Oi at Sa, and obtains an outcome corresponding to eigenstate 
\il^k)A the global state is reduced to \ipk)A ® \uk)B- If, on the other hand, one chooses 
to measure a non-commuting observable O2, with eigenstates one should instead 

use the expansion 

I*) = $^c;I0.)a®|^.)b, (3.2) 

s 

where \vs)b represent another set of normalised states oi Sb- Now if the outcome of this 
measurement is, say, the one corresponding to \4>r)A, the global state is thereby reduced 
to |0r)yi® \vr)B- Therefore, "as a consequence of two different measurements performed 
upon the first system, the second system may be left in states with two different wave 
functions". This is just what Schrodinger later termed steering, and we'll return to that 
later. Now enters the crucial assumption of locality. 



EPR's locality assumption: "Since at the time of measurement the two 
systems no longer interact, no real change can take place in the second 
svstem i n consequence of anything that may be done to the first system." 
EPR35| 



"Thus", conclude EPR, "it is possible to assign two different wave functions to the same 
reality" . They now consider a specific example where those different wave functions are 
eigenstates of two non-commuting operators. If the initial state was of type 

/oo 
^i^APlh^-ixsp/h^^^ (3.3) 
-00 

then if one measures momentum at Sa and finds outcome p, the reduced state of 
subsystem Ss will be the one associated with outcome —p of . On the other hand, if 
one measures position and finds outcome x, the reduced state of Sb will be the one 
corresponding to outcome x of x^ . By measuring position or momentum at Sa, one 
can predict with certainty the outcome of the same measurement on Sb- But p^ and 
x^ correspond to non-commuting operators. EPR conclude from this that 



"In accordance with our criterion of reality, in the first case we must consider 
the quantity [p^] as being an element of reality, in the second case the 
quantity [x^] is an element of reality. But, as we have seen, bo th wave 
functions [corresponding to —p and x] belong to the same reality." |EPR35| 



In other words, by using the sufficient condition for reality, the assumption of locality 
and the predictions for the entangled state under consideration, EPR conclude that 
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Alice Bob 



, Pa , Pb 

Figure 3.1: The EPR scenario. Alice and Bob are two spatially separated observers 
who can perform one of two (position or momentum) measurements available to each 
of them. 

there must be elements of reality associated to a pair of non-commuting operators. So 
the (2) horn of EPR's dilemma proved before is closed, leaving as the only alterna- 
tive option (1), namely, that the quantum mechanical description of physical reality is 
incomplete. 

In hindsight, as we now know that the premise of locality is not entirely justified, we 
can read EPR's argument as demonstrating the incompatibility between the premises of 
locality, the completeness of Quantum Mechanics and some of its predictions. However, 
one could block the conclusion of the argument by rejecting those statistical predictions 
required to formulate the argument. This move is particularly easy to be made since 
the necessary predictions are of perfect correlations, unobtainable in practice due to 
unavoidable inefficiency in preparation and detection of r eal phy sical systems. This 
problem was considered by W. H. Furry already in 1936 |Fur36l | but experimentally 
useful criteria for the EPR paradox were only proposed in 1989 by Margaret Reid 



Rei89| 



3.2 The EPR-Reid criterion 



The essential difference in the derivation of the EPR-Reid criteria |Rei89l | and the 
original EPR argument is in a modification of the sufficient condition for reality. This 
could be stated as the following: 



Reid's sufficient condition of reality: If, without in any way disturbing 
a system, we can predict with some specified uncertainty the value of 
a physical quantity, then there exists a probabilistic element of physical 
reality which determines this physical quantity with at most that specific 
uncertainty. 

The scenario considered is the same as the one for the EPR paradox above, as depicted 
in Fig. 13.11 but one does not need a state which predicts the perfect correlations 
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considered by EPR. Instead, the two experimenters, Alice and Bob, can measure the 
conditional probabilities of Bob finding outcome in a measurement of given 
that Alice finds outcome xa in a measurement of xa, i.e., P{xb\xa)- Similarly they 
can measure the conditional probabilities P{pb\pa) and the unconditional probabilities 
P{xa), P{pa)- We denote by A^(x'^|x^), A^(p^|p^) the variances of the conditional 
distributions P{xb\xa), P{pb\pa), respectively. Reid now defines the average inference 
variances 

AL/(P^|P^) = $^P(/)A2(P^|/). (3.4) 

pA 

The notation Af^jr{x^\x^) is to indicate the average inference variance of x^ given that 
Alice measures x^ and similarly for p^ . Reid argues, by use of the sufficient condition 
of reality above, that since Alice can, by measuring either position x'^or momentum 
p^ , infer with some uncertainty Am/(x^|x'^) or Ainf{p^\p'^) the outcomes of the cor- 
responding experiments performed by Bob, and since by the locality condition of EPR 
her choice cannot affect the elements of reality of Bob, then there must be simulta- 
neous probabilistic elements of reality which determine x^ and p^ with at most those 
uncertainties. Now by Heisenberg's Uncertainty Principle (HUP), quantum mechanics 
imposes a limit to the precision with which one can assign values to observables cor- 
responding to non-commuting operators x and p. In appropriately rescaled units the 
relevant HUP reads AxAp > 1. Therefore, if quantum mechanics is complete as defined 
by EPR and the locality condition holds, by use of the adapted sufficient condition of 
reality, the limit with which one could determine the average inference variances above 
is 

A„^(x^|x^)A,„^(p^|p^) > 1. (3.5) 

This is the EPR-Reid criterion. Violation of that criterion signifies the EP R paradox . 
It has been used in experim e ntal demo nstration with continuous- variables 
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of the position and momentum observables. 



3.3 Formalising EPR 

I will now propose a mathematical formalisation of the premises of the EPR argument 
following the formalism of Chapter [2l 

It is evident that EPR had well in mind the basic Axioms of the MRRF defined on 
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page [TOl We have already argued in 13.11 that the first of EPR's desiderata, "Is the 
theory correct?", is taken care of automatically by the definition of a 'model', since it 
is required that it correctly predicts the phenomenon under study. So we can now use 
the ontological model to formalise the rest of their premises. 

3.3.1 The original argument 

Let us first understand what EPR's sufficient condition of reality amounts to. It is a 
criterion which, when satisfied, assigns a physical variable to the set of variables which 
have an "element of reality" associated to them. Let us denote this set by ETZ. The 
setup of the ontological model is essentially the same as that considered by EPR, so 
by EPR's sufficient condition of reality, if after the measurement of an observable a in 
which outcome A is obtained, one deduces that the probability of now obtaining B in 
a measurement of h is unity, then there's an element of reality associated to h. 

Definition 1 (EPR's sufficient condition of reality) // the outcome of 
measurement h is predictable given the outcome of measurement a, then 
there's an element of reality associated to h, i.e., 

P{B\A,a,h,c)e{Q,l} ^heSn. (3.6) 

The necessary condition for completeness is not so transparent. It is a priori not obvious 
what EPR had in mind by "a counterpart in the physical theory". However, we can 
extract the meaning by looking at when they actually use that condition. It is just 
in the justification of what I've called EPR's dilemma, already mentioned in Section 
13.11 on page [39l "If then the wave function provided such a complete description of 
reality it would contain these values; these would then he predictable" [my emphasis]. 
In other words, it is necessary, for EPR, that the variables corresponding to elements 
of reality be predictable by the wave function if it provides a complete description of 
reality. The wave function is fully specified by the preparation procedure, so denoting, 
following EPR, C{OQT) = Hhe quantum-mechanical description of reality given by the 
wave function is complete" we obtain 

Definition 2 (EPR's necessary condition for completeness) // the 

quantum mechanical description of reality given by the wave function is 
complete, then if there's an element of reality associated to h, the value 
Bofh given only the preparation procedure must be predictable, i. e. 

C{OQT) ^{beSn^ P{B\b, c) G {0, 1}). (3.7) 
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We can now derive EPR's dilemma. Assume C{OQT) and take two non-commuting 
observables b and b' . If there's an element of reality associated with both of them, 
then by Definition 2 they must both be predictable. But we know by the HUP that 
no quantum model involving non-commuting operators is predictable. Therefore there 
cannot be elements of reality associated to both of two non-commuting operators. De- 
note this last statement NC. Then C{OQT) — > NC, which is logically equivalent to 
^C{OQT) V NC . EPR's dilemma can now be stated as 

Theorem 1 (EPR's dilemma) // the quantum mechanical description 
of reality given by the wave function is complete, then there cannot be 
elements of reality associated to both of two non-commuting operators. 

Now enters the example of the entangled state. With that state, one can find observables 
b, b', a, a' such that P{B\A,a,b,c) e {0,1} and that P{B'\A',a',b',c) e {0,1}. Let us 
denote this observation ENT. Given ENT and Definition 1, there must be elements of 
reality associated to both b and 6', i.e., 6, b' G £71. This is where the locality assumption 
enters the reasoning in a fundamental way, by requiring that the elements of reality at B 
do not depend on the choice of experiment at A. Definition 1 used that implicitly by not 
mentioning that one should actually carry out the measurements under consideration, 
only that one would be able to predict the outcome of 6 (or b') if one decided to measure 
a (or a'). Close to the end of their paper, EPR remark that 

"One could object to this conclusion on the grounds that our criterion of 
reality is not sufficiently restrictive. Indeed, one would not arrive at our con- 
clusion if one insisted that two or more physical quantities can be regarded 
as simultaneous elements of reality only when they can be simultaneously 
measured or predicted. [...] This makes the reality of P and X depend upon 
the process of measurement carried out on the first system, which does not 
disturb the second system in any way. No reasonable definition of reality 



could be expected to permit this." |EPR35| 



Therefore given the observation of ENT and Definition 1, there must be elements of 
reality associated to both non-commuting operators, and the consequent of Theorem 1 
is false. Therefore the antecedent must be false, and EPR conclude, quantum mechanics 
is incomplete. 



Theorem 2 (the EPR argument) The sufficient condition of reality and 
some predictions of quantum mechanics imply that the quantum mechan- 
ical description of reality is incomplete. 
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3.3.2 General formalisation 

We can understand what kind of locality EPR had in mind if we revisit their condition 
for completeness. In Definition 2 we see that if there's an element of reality associated 
to b, and quantum mechanics is complete, then the outcome of b given the quantum 
state must be predictable. They implicitly mean that if there's an element of reality 
associated to a variable, then if one had the knowledge of enough facts about the 
system one would be able to predict the outcome of a measurement of that observable 
with certainty. In the language of the ontological model of Chapter [2l this means that 
there must be a deterministic hidden variable model for the outcome of b. With this 
consideration Definition 1 implies (I remind the reader that the expressions are to be 
understood as valid for all values of the variables, including A) 

P{B\A, a,b,c) e {0,1} ^b e Sn^ P{B\b, c. A) G {0, 1}, (3.8) 

Now it is obvious that 

PiB\A, a, 6, c) e {0, 1} ^ PiB\A, a, 6, c, A) G {0, 1}, (3.9) 

and to obtain (13.81) from (13.91) we just need the assumption of local causality as defined 
in Chapter [2], which we reproduce here for completeness: 

Corollary 1: A model is locally causal, i.e., a model satisfies local 
causality (LC) iff 

PiB\A, a, 6, c. A) = P{B\b, c. A), (3.10) 
plus the corresponding equations for A. 

So the locality concept that goes into Definition 1 is that of local causality. Remember 
that this implies that the joint probabilities factorise, i.e., that any model that satisfies 
local causality must have the form 

f{A, B\a, b,c) = J2 PiMc) PiA\a, c, X)P{B\b, c. A). (3.11) 

AeA 

What about completeness? If Orthodox Quantum Theory is complete, then for pure 
states it is essentially the assumption that there are no hidden variables. No possible 
new information could change the probability assigned to the outcomes of measure- 
ments. Since hidden variables could be unknowable by any observer even in principle, 
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such as is the case in Bohmian mechanics, a more correct statement would be that no 
actual variables exist in the past light cones of the measurements under study such 
that, conditionalised on those variables, the probabilities of the outcomes of said mea- 
surements would be further specified. Formally, that means that if OQT is complete, 
then the probabilities of Bob's outcome, say, can only be those allowed by quantum 
states Pq(5|6, c. A) = TrpBpfJ. I use the subscript Q to indicate quantum prob- 
abilities, where IIb is a projector onto the subspace corresponding to outcome B of 
observable h while p^^^ is a positive unit-trace operator for subsystem B associated with 
the preparation c and with the unknown variable This is a strong constraint, since 
by Heisenberg's Uncertainty Principle, not all probabilities are allowed to be associated 
simultaneously to two non-commuting operators h and h' . By those considerations, we 
arrive at the first main result of this Chapter. 



Proposition 1: The conjunction of local causality and completeness implies 
/{A, B\a, b,c) = J2 PiMc) PqiAla, c, \)PQ{B\b, c. A). (3.12) 



AeA 



This is equivalent to the statement that there exists a separable (i.e., non-ent angled) 



state which correctly describes the phenomenon. An entangled, or non-separable |Wer89 | 
quantum state p for two subsystems A and B, is one which cannot be written as a convex 
combination of product states, i.e., as 

i 

where the r]i are probabilities, i.e., they are real non-negative numbers and YliVi = 1; 
and pf and pf are quantum states for subsystems A and B respectively. The conse- 
quence has not been widely recognised before and deserves to be emphasised. 



Any proof that a phenomenon cannot be described by a separable 
state is proof that the phenomenon violates the conjunction of 
local causality and completeness of operational quantum theory. 



That is, the conjunction of local causality and completeness of quantum theory, the 
basic premises of the EPR argument, can be shown to be inconsistent with observation 

^Remember that A are not necessarily "hidden" variables in the sense of being fundamentally 
unknowable, but rather are variables which are ignored by the preparation procedure. They may 
be actually unknowable, such as the hidden variables in Bohmian mechanics, but they could be just 
ignored but knowable variables. For example, if one prepares a mixed state, there could be in principle 
knowable further variables that would specify a pure quantum state, and those are included in A. 
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Figure 3.2: The hierarchy of locality models. The diagram depicts the 'state space', 
that is, the space of possible quantum states. If a certain state is separable then all 
measurements that can be performed on it have a LHS model and if it has a LHS model 
then it will have a LHV model, but the converse implications do not hold. 

without the need for EPR 's sufficient condition of reality. The reason is that (13.121) is 
the most general model which is compatible with both of these premises, allowing for 
the most general distribution of local elements of reality which are compatible with the 
assumption that operational quantum theory is complete. 

However, the EPR argument was asymmetric in that it considered the effect of knowl- 
edge of the outcomes of measurements on one side of the apparatus, say Alice's, on 
the state on the other side, say Bob's. Schrodinger called this effect steering, and that 
arguably reflects the spirit of the EPR paradox more closely than the general nonsepa- 
rability of (13.121) . We will therefore use the term EPR paradox to refer to this particular 
kind of violation of local causality and completeness, and not the general violation of 
[461 This is the topic of the next subsection. 



3.3.3 The connection with Steering 

Suppose we assume completeness of OQT for Bob's subsystem, but do not restrict the 
hidden variables in Alice's side in any way. That implies 

Proposition 2: The conjunction of local causality and completeness for Bob's subsys- 
tem implies 

/(A B\a, b,c) = J2 PiMc) PiA\a, c, \)PQ{B\b, c. A). (3.13) 

AeA 



This was termed a Local Hidden State (LHS) model by Wiseman and co-authors |WJD07l | 
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and was proposed as a formalisation of the concept of steering, which can best be de- 
scribed in the quantum information fashion, that task. 

The task is the following. Alice wants to convince Bob that she can act on his state, or 
steer it, at a distance. If Bob does not trust Alice and believes he has a local quantum 
state, he will believe that the measurement statistics of their experimental outcomes 
will be given by f l3.13p . since that is the most general way in which his local quantum 
state can be classically correlated with Alice's measurement outcomes. Therefore if 
by looking at Alice's and his own joint measurement probabilities. Bob cannot find a 
model of form (13.13p . he'll be convinced that Alice's choice of experiment can somehow 
steer his own state. 

In that letter, Wiseman et al. answered a few questions about steering. They have 
demonstrated that, as applied to states, the concept of Bell non-locality, or the inex- 
istence of a Local Hidden Variable (LHV) model, is strictly stronger than steerability 
and steerability is strictly stronger than non-separability, that is, that if a quantum 
state demonstrates Bell non-locality then it necessarily demonstrates steering but not 
vice- versa, and if it demonstrates steering it necessarily demonstrates non-separability, 
but not vice-versa. This hierarchy is schematised in Fig. 13. 2[ 

Here we will be interested in a different question. Our purpose is to experimentally 
demonstrate steering, which by the preceding arguments represent a demonstration of 
the EPR paradox. When do the experimental outcomes violate (13.13p ? We want to find 
clear experimental signatures, in the form of inequalities, which, when violated, imply 
steering or the EPR paradox. We'll first show that the LHS model directly imply the 
EPR-Reid criterion for continuous-variables, and then we'll derive new criteria for the 
EPR-Bohm paradox. 

3.3.4 The EPR-Reid criterion re-derived 

It is easy to show that for the original EPR state, the measurement outcomes cannot 
be described in the form (13.131) . To see that, note that for all A, Alice can choose to 
measure a or a' and thereby predict with certainty either b or b' . But the value of 
A is independent of Alice's choice. So for all A, the outcomes of b and b' must be 
determined, that is, it must be simultaneously the case that PQ{B\b,c, \) G {0, 1} and 
PQ{B\b',c,X) G {0, 1}. But no quantum state allows that probability assignment, and 
therefore the LHS model cannot describe the joint statistics. 

But in the laboratory there are no such perfect correlations. What about the more 
general case of imperfect detection and preparation efficiencies? And what about other 
states which do not predict such correlations even with perfect efficiencies? 
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Recall that we defined, for the situation considered in Section [X2l average inference vari- 
ances as A^^/x^la;^) = J2.a P{x^)A\x''\x^), Af„/p^|]3^) = P(p^) A2(p^|p^). 
Consider the LHS model of f l3.13p . which applied to the situation at hand reads (omit- 
ting henceforth unnecessary notation) 

P(a;^,a;^) = 5^ P(A)P(x^|A)Pq(x^|A). (3.14) 

AeA 

The conditional probability distribution of given an outcome x^ of x^ is then 

P(x''|x^)^^ ™;'l'' p«(x^|A). (3.15) 

It is a general result that if a probability distribution can be written in the form of a 
convex combination of further normalised probability distributions, 

P(z) = 5^P(z)P(^|z), (3.16) 

i 

then the variance of the resulting combination is larger than or equal to the combination 
of the variances, i.e., 

A^z > ^P(z)A2(z|z). (3.17) 

i 

Eq. (13.131) has the form of (I3.16p . so the variance A^(x^|x^) of the conditional distri- 
bution (|3.15l) must satisfy the inequality 

^.,,B|^., > E ''"j>gr)''" A|(x«|A). (3.18) 



where Aq(x^|A) represents the variance of Pq(x^|A). Substituting fl3.18p into the defi- 

' A 



nition of Aj„f(x^\x^) we obtain 



= ^P(A)A|(a;^|A). (3.19) 



A similar procedure produces the equivalent inequality for A^^j(p'^|p^). We now define 
two vectors 

u = (v/p(AOAQ(a;^|Ai), v/p(A^AQ(a;^|A2),...) 

V = (V'P(AOAq(p^|Ai), v/P(A^Aq(p^|A2),...), (3.20) 
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and note that P(A)Aq(x-^|A) = j-up and similarly for and v. By use of the 
Cauchy-Schwarz inequality, i.e., \u\\v\ > u ■ v we obtain 

Al,{x'\x^)Alfip^\p^) > 5^P(A)A^(x^|A)5^P(A0A^(p^|A') 

A A' 
I |2| |2 \ / \2 

= \u\ \v\ > [U ■ V) 

= • (3-21) 

Now remember that Aq(x^|A) and Aq(p^|A) represent standard deviations over the 
same quantum state pf . Therefore they must obey the HUP 

Aq(x^|A)Aq(p^|A) > 1, (3.22) 

and by substituting (13.221 ) in (13.211 ) we finally arrive at the EPR-Reid criterion 

A,„^(x^|x^)A,„^(p^|p^) > 1. (3.23) 

So we see that we can derive the EPR-Reid criterion directly from the premises of local 
causality and completeness, which imply the LHS model. No additional condition of 
reality is necessary. 



3.4 Criteria for the EPR-Bohm paradox 

In Bohm's EPR paradox, spin measurements and are performed, simultaneously, 
on two spatially separated subsystems, A and B. For the quantum states 

1^.) = E ^Uj. -m)B (3.24) 

m=-j 

there is a maximum correlation between the results if the same spin component is 
measured at each location. Here \j, m)A/B are the eigenstates of and Jz respectively, 
for A/B. Opposite outcomes are predicted for andO Jq , so that the outcome of 
measurement of any one of J^, J^, Jf can be predicted, with absolute certainty, by 



X 5 "y 

measurement of one of the J^, Jy^ J^. Bohm's original variant |Boh5l| of the EPR 
argument considered only the Bell-state |^i/2)- 

The EPR-Bohm argument follows analogously to the EPR argument analysed before 
in this Chapter, except that instead of being based in the uncertainty principle for 
position-momentum, it is based on that for the spin observables, as depicted in Fig. 
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Alice Bob 

• * " • 

Xa , Ya , Xg , Yb, Zb 

Figure 3.3: The EPR-Bohm scenario. Alice and Bob are two spatially separated ob- 
servers who can perform one of three spin measurements available to each of them. 

m 

One of the uncertainty relations that impose lower bounds on the uncertainties for spin 
observables is of the form 

AjfAjf >|(jf)|/2. (3.25) 
The lower case je indicate the outcomes of measurements Jg. 

To derive experimental criteria for the EPR-Bohm paradox, we define the measurable 
averag e inference variance in the prediction of J?, based on measurement at A 



Rei89|. 



ALj? = ((j? - J^ie,estr) > E Pij^)^HjS\j^) = ^IfUW) (3-26) 

Here is an inferred estimate for , given an outcome for J^, and the average 

is over all outcomes , jf. The inequality follows because, for a given jf, the estimate 
that minimises {{j^ ~ 3!^\e est)"^) mean of P(j^|j^). This minimum is optimal, but 

not always accessible, in EPR experiments. From now on we will derive everything for 
the A^^j(j^| J_^), but one should keep in mind that one can measure the less optimal 
but possibly more accessible A^^J^ , modifying the inequalities in an obvious way. 

We will use a technique similar to that used to re-derive the EPR-Reid criterion in the 
previous section. We first assume that the measurement statistics can be described by 
a LHS model, which implies 

^IAjW) >J2Pi^)^U3F\>^)^ (3-27) 

A 

for all 9, 0, by an entirely analogous reasoning as in the derivation of f l3.19p . 
We now again define two vectors 



u = (v/p(AOAQ(jf|Ai), v/p(A^AQ(jf|A2),...) 

V = (v/i^AQ(jf|Ai), v/P(A^Aq(j,^|A2),...), (3.28) 
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and by using the Cauchy-Schwarz inequahty 

^lfij^\jMlfijy\Jy) > J2P(^^^Uj^\^^J2P(^'^^Ujy\^') (3-29) 



> 



A 

But in this case the relevant uncertainty relation is 

AQ(jf|A)AQ(jf|A)>i|(jf|A)|, (3.30) 

where (j^|A) represents the average of over the quantum state pf . By substituting 
( ICTll in (13:2911 

A,„Xjf |4^)A,„Xjf |J^^) > i$^P(A)|(jf|A)|. (3.31) 

A 

Now note that further specifying the hidden variables cannot decrease the average of 
the modulus of the mean, so that 

and finally we arrive at the following theorem. 

Theorem 3: The following inequality if violated for spatially separated sys- 
tems A and B would demonstrate Bohm's EPR paradox 

A.„/(jf |jf)A,„K^ 1^,^) > Y.P^3tW'^\3t)\- (3.33) 



We reiterate that the choice of measuring J^, for example, to infer was purely 
conventional. It will be the best choice in a state of type f l3.24p . but nothing special 
hangs on which particular quantum observable actually corresponds to what we called 
J^, Jy J^. In a practical situation one should choose whichever observables optimise 
the violation of f l3.33p . The observables at B, however, must actually correspond to 
three mutually orthogonal directions, as the uncertainty principle used is only valid in 
that situation. 

We point out that, from ( 1X321) . A,„ f { \ ) A,;„, /• ( | ) < |(jf)|/2 also signifies the 



EPR paradox, as shown by Bowen e t al |BSBL02| . who demonstrated this inequality 



experimentally for Stokes operators |Kor07l | that represent the polarisation of optical 



fields. For the states (13.24p . (jf ) = and this latter form is not useful. 
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Alice Bob 

• < > o 

Figure 3.4: The extended EPR-Bohm scenario. Alice and Bob are two spatially sepa- 
rated observers who can perform one of four measurements (one of three "spin" com- 
ponents or a total number measurement) available to each of them. 

The states (13.24p violate ( 1 3.331) and can be investigated experimentally using parametric 
down conversion 



LLHBOll |. We re-express f l3.24p using Schwinger's formalism 



1^.) = ^^_J^ {alhl-aH\r\Q), (3.34) 

where a± are boson operators for orthogonally-polarised field modes of A, a similar set 
is defined for B, j = N/2 (where here j represents the eigenvalues of as in state 
f l3.24p ) and |0) is the multi-mode vacuum. We define the Schwinger spin operators 

= - (^a^a\_ + aLd+ 
jf = — (a^a\ — ala^ 



N 



A 



ci+a+ + aLa_^ . (3.35) 



The situation of the EPR-Bohm setup is therefore extended with number measurements, 
as in Fig. 13. 4[ 

Fields a± and b± are spatially separated, and impinge on polarising beam splitters to 
enable a measure of the photon number differences, and , the spin measurements 
J^/yj Jx/y being made with phase shifts that immediately precede the polariser. 

A case of experimental interest is j = 1/2, the Bell state, but with detection efficiency 
?7, so that there is a non-zero probability (1 — 77) of photons not being detected. The 
outcome of no detection corresponds to for both d!^a+ and a^a_ and hence for 
(similarly for J^). Calculation reveals 

^IfUxlJx) = ^Ifijyljy) = riil - r/^)/4 (3.36) 

and 

E^(^'^^)l(^'i'l^'^^)l=^V2, (3.37) 
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so the parado xical co rrelations are obtained where r] > 0.62. This is consistent with the 
Werner state 



Wer89| 



P.) - + P. 1^1/2) (^1/2 I 



(3.38) 



(I is the maximally mixed density operator) which yields the EPR paradox when ps > 
0.62. Entanglement occu rs when, and only when, ps > 0.33, so the EPR paradox is 
more difficult to confirm 



BSLR03 



WJD07|. 



The states (13.341) can be generated using two parametric amplifiers |CS78l | as modelled 
by the interaction Hamiltonian 



H = ihK{a\.bl_ — aLb\) — ihK{a+h- — a-b^ 



(3.39) 



Assuming the initial state to be a vacuum, the solution after a time t is a superposition 
of the (I3.34p . The predictions of a particular \ipj) could be tested by restricting to the 
ensemble with a fixed . Most interesting is the limit of large (N^). We therefore 
propose to detect the Bohm-EPR correlation using the full solution of H. For macro- 
scopic systems, measurement of all the conditional probabilities can be difficult. We 
present an alternative Bohm-EPR criterion. 

Theorem 4: Violation of the following inequality for spatially separated A 
and B reveals an EPR paradox 



Ajk/(jf 1^) + ^infUyVJ^) + AL/(jf I ) > 



iB\ jB\ 



(3.40) 



The proof is analogous to that of (13.331) . except that we follow 
a quantum uncertainty relation 



Tot04 



HT03| to write 



A'j^ + A^j;f + A^jf > A^iV^/4 + (iV^)/2. 



(3.41) 



Inequality (IXiOD is tested for IK^. wit h detection efficiency r]. The A^^j'f of (13:261) 
is defined for the linear estimate |Rei8d | jg^gf = gj^, where g = —{jgjg')/{jg'jg') to 
minimise A^^^j^, giving solutions 

^Itji = (UF?) - {jFje)V{Uff) = VS^nhMl-v' + M^-v)stnh'r)/2{l + r]stnh\) 

(3.42) 

and 

(A^^) = 2risinh^r, (3.43) 

where r = \K\t. Fig. 13.51 plots the minimum efficiency 77 required for violation of (13.401) . 
to indicate a test of macroscopic EPR for large (A^^) and rj > 0.66. 



3.4 Criteria for the EPR-Bohm paradox 



55 



mm 




Figure 3.5: Minimum detection efficiency for violation of (13.401) by f l3.39p versus mean 
photon number. 



I should clarify that inequalities f l3.33p and f l3.40p of Theorems 3 and 4 are both com- 
pletely general and valid for arbitrary detector efficiencies, and the explicit introduction 
of the efficiencies rj in the above discussions are to model what would happen in an exper- 
imental situation. All that one needs for their experimental evaluation are the inferred 
variances (or their estimates), defined in fl3.26p . and either the probability distributions 
P{jf) and conditional averages for (13:331) or the average (A^^) and variance 

A'^N^ for ( 13.40p . For the latter, a further clarification is needed that the inequality is 
derived for arbitrary number distributions. 



The existence of macroscopic quantum superpositions |CR06l | are implied by violation of 
the EPR-Bohm inequality where (A^^) is largqj. Measurement of a tiny A^^jf implies 



the existence of large superpositions of the eigenstates | j^) of (or J^ ), superpositions 



tB 



^Following |CR06l | and the formalism to be presented in Chapter HJ we suppose p to be expressible 
as a mixture of microscopic superpositions of and Jy only, so p = J^i^iPi — J2i ■Pi\''l'i,s){'4^i,s\, 
where each \tpi,s) is a superposition of eigenstates of separated by no more than S. For such 
Pi there is the constraint A^j^ < <S'^/4; similarly A^jy < 5^/4. For each pi (being a quantum state) 
^Infjf + ^^jy + ^^jx > {N^) /2 (see next paragraph), so Af^fjf > {N^) /2 - This inequality 

must also hold for the mixture p. 

To prove the uncertainty relation above, define the reduced state p^A of B given result for J^; its 
variance for Jf is A^J^lj^). Now p^ - Tr^p = Pij^)pfA so jf/^ > P(j,^)A2( jf/Jj'f ), 
and since p^A is a quantum state satisfying A2jf + A Vjf + A > (A^-^)/2, we get Af^^J^ + A'^J^ + 
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Figure 3.6: Minimum size 5* of superpositions that may be inferred from violation of 
( I3.44p with the Hamiltonian of ( 13.39p versus mean photon number. 

with a range in of at least 

S = y/(iV^)-2ALj?, (3.44) 

which becomes ~ ^/vi^^) foi^ (I3.39p with (A^^) large (Fig. 13.61) . I won't attempt to 
explain this point in much detail as it will be the subject of Chapter O 



Chapter 4 

Continuous- Variables Bell inequalities 



As we have seen in Chapter [3l Einstein, Podolsky and Rosen (EPR), in their famous 
1935 paper |EPR35| . demonstrated the incompatibility between the premises of 'local 
realism 'u and the completeness of quantum mechanics. The original EPR paper used 
continuous position and momentum variables, and relied on the ir commutation rela- 
tions, via the corresponding uncertainty principle. Bohm |Boh5ll | introduced, in 1951, 



' the E PR paradox with spin observables. This was the version that was 
to prove his famous theorem showing that quantum mechanics pre- 



his version o 
used by Bell 

diets results which can rule out the whole class of local hidden variable (LHV) theories. 
It is hard to overemphasise the importan ce of th is result, which has even been called 



"the most profound discovery of science" [StazJ. However, the original Bell inequal 



ity, and all of its generalisations, are directly applicable only to the case of discrete 
observables. The main purpose of this chapter is to close the circle and derive a class 
of Bell-type inequalities applicable to continuous-variables (CV) correlations, together 
with multipartite generalisations. 



4.1 Motivation 

The last decades have witnessed the birth of whole new areas of enquiry, which aim 
to harness these non-classical correlations predicted by Quantum Mechanics towards 
information-processing applications . Bell inequality violations have been show n to be 



soni 



AGMSn4 



BHKn5 



relevant for quantum teleportatio n |Pop94 lHHH96l |. quantum key distribution 



AGMQ6|, and reduction of communication complexity |BZPZ04| 



Eke91 



However there are still many unanswered questions. Bell inequality violation was once 



^We have already seen in Chapters[2]and[3]what this term means and what assumptions particularly 
are needed for the EPR argument. Having that in mind, I will keep using the vague but popular 
terminology "local realism" for simplicity. 
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thought to be equivalent to entanglement, until Werner |Wer89l | showed that th ere are 
entangled states that do not violate a large class of Bell inequ alities. G isin |Gis91 | 
proved that all bipartite pure entangled states violate the CHSH |CHSH69l | ine quality , 
a result which was gener alised to the multipartite case by Popescu and Rohrlich |PR92 |. 



Later, Horodecki et al. 



HHH95| devised an analytic criterion to determine whether a 



mixed state of two qubits have a LHV description. Apart from this simple case, there 
is no known general method to decide whether a quantum state violates some Bell 
inequality. 

For n parties, m measurements per party and o outcomes, it is well-known that the 
set of correlations allowed by LHV theories can be represented as a convex polytope, 
a multi-dimensional geometrical structure formed by all convex combinations (linear 
combinations where the coefficients are probabilities, i.e., they are non-negative and 
sum to one) of a finite number of vertices. The vertices of this pol ytope are the classica l 



pure states — the states with well-defined values for all variables |Pit89l . lPer99l . lGis07l |. 
The tight Bell inequalities are associated with the linear facets of the polytope. It 
is a computationally hard problem to list all Bell inequalities for given {n,m,o), and 
full numerical characterisations have been accomplished only for small values of those 
parameters. 

Nevertheless, some special cases can be considered. One can derive classes of Bell 
inequalities which are recursively defined in terms of t hose pa rameters. There are classes 



Mer9n 



for (n, 2, 2) 
and ey en {n, 2, o) 



Ard92 



Cabnl 



BK9 



llwwnil |. 



SLKoJ, (n,m,2) 



Zukn 



Zuk9 



I 



ij2 



,2^ 



LPZBn4 



MerSO 



CGL+n2| 



NLPnd | and (2,m,o) 



CGOj|. However, no class of Bell inequalities has previously been derived without any 



reference to the number o of outcomes or to their bound. Any real experiment will 
always yield a finite number of outcomes but are there constraints imposed by LHV 
theories that are independent of any particular discretisation, and can be explicitly 
written even in the li mit o — » oo? This question goes beyond a challenge proposed in a 
recent paper by Gisin jGisO?! . where he proposes a list of open questions regarding Bell 
inequalities, which was (in other words) to find inequalities valid for arbitrary but fixed 
o.Our answer is yes; and the derivation is much more straightforward than in the case 
of the usual Bell-type inequalities which are restricted to a particular set of outputs. 

We derive a class of inequalities for local realism that directly uses correlations of 
measurements, with no restriction to spin measurements or discrete binning. They 
are not only valid for arbitrary but fixed o, but they do not mention the number of 
outcomes o in their derivation at all. The new inequalities are remarkably simple. They 
place no restriction on the number of possible outcomes, and the contrast between 
the classical and quantum bounds involves commutation relations in a central way. 



4.2 The variance inequaUty 
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They must be satisfied by any observations in a LHV theory, whether having discrete, 
continuous or unbounded outcomes. We can immediately re-derive previously known 
Bell-type inequalities, obtaining at the same time their quantum-mechanical bounds 
by considering the non-commutativity of the observables involved. We also display 



Mer9nl. 


Dru83. 


" 

Per99. 


ReiOl 



measurements, even in the macroscopic, large n limit 
show that the new Bell violations survive the effects of finite generation and detection 
efficiency. This is very surprising, in view of the ma ny examples in which decoherence 



rapidly destroys macroscopic superpositions |Zur03 |. 



Apart from this intrinsic interest, these inequalities are relevant to an important sci- 
entific problem. No experiment has yet produced a Bell inequality violation without 
introducing either locality or detection loopholes. As emphasised by Gisin in his re- 
cent article jGisO?! . "quantum nonlocality is so fundamental for our world view that 
it deserves to be tested in the most convincing way". One path towards this goal 
is to use continuous-variables and efficient homodyne detection, which allows much 
higher detection efficiency than is feasible with discrete spin or photo-detection mea- 
suremen ts. A number of loop-ho l e free proposals exist in the lite rature, but they all 
use Bell 



LV95 



GDR,98l . 



AMR,Sn2 



m 



WHG+n,l lGPFC+n4j or Hardy |YHS99| inequalities 



with a dichotomic binning of the r esults (which usua l ly lead t o small violations) , or else 
a parity or pseudo-spin approach 
with efficient homodyne detection. 



BW99 



CPHZ02 



IsBKOO l 



which cannot be realized 



4.2 The variance inequality 



We will focus on the correlation fimctions of observables for n sites or observers, each 
equipped with m possible apparatus settings to make their causally separated mea- 
surements. We consider any real, complex or vector function F (X, Y, Z, . . .) of local 
observations Xj, Yi, at each site i, which in an LHV theory are all functions of hidden 
variables A. In a real experiment the different terms in F may not all be measurable at 
once, because they may involve different choices of incompatible observables. The as- 
sumption of locality enters the reasoning by requiring that the local choice of observable 
does not affect the correlations between variables at different sites, and therefore that 
the averages are taken over the same hidden variable ensemble P (A) for all terms. We 
introduce averages over the LHV ensemble (there's no loss of generality in considering 
deterministic LHVs, since, according to Fine's Theorem 2.8, a phenomenon violates 
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local causality if and only if it violates local determinism |Fin82| ) 



{F)= / P(A)F(X(A),Y(A),Z(A),...)rfA. 



(4.1) 



Our LHV inequality uses the simple result that any function of random variables has a 
non-negative variance, 

(4.2) 



|(F)|^<(|F|^). 



We can also give a bound 



supi 



(4.3) 



where the subscript denotes the supremum (least upper bound), in which products of 
incompatible observables are replaced by their maximum achievable values. This is 
necessary since if we are not able to measure both X j and Y j simultaneously, a general 
LHV model could predict any achievable correlation [SU07|. 



The same variance inequality applies to the corresponding Hermitian operator F in 
quantum mechanics. While the observables at different sites commute — they can be 
simultaneously measured — those at the same site do not, so operator ordering must 
be included. This enables us to see how quantum theory can violate the variance bound 
for an LHV. 



4.3 An example 



As an example, we will apply this variance inequality to a well-known case. Consider 
two dichotomic observables Xi.Yi per site i, the ou tcomes o f which are ±1. We define 
Fi = Xi, F[ = Yi , and then inductively construct |GBP98l |: 



F„ = -(F„_i + F:_JX„ + -(F„_i - (4.4) 

where can be obtained from F„ by the exchange Xi < — > Yi. In calculating 
we'll keep track of the local commutators just to make the contrast with quantum 
mechanics clearer. For real random variables it is obvious that the commutators are 
zero; at a first read one can just regard that as the act of an overzealous classical 
theorist. For real variables X, Y, the commutator is defined in the same way as for the 
corresponding operators, i.e., [X, F] = XY — YX. The anti-commutator is defined by 



4.3 An example 
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[X, Y]+ = XY + YX. Then 
Fl = \{{Fl_^ + Ft^){Xl + Y^) 

(4.5) 



Since = = 1, we can show that = FJ^^ and 



n n 



Fn ~ Fn-l ^ [Fn-1, F^_-^'^ [X„, 1^] . 



(4.6) 



In a LHV theory, the term which involves commutators will be zero since [X(A), 1^(A)] = 
X(A)F(A) — F(A)X(A) = 0. Hence by induction = = 1 and the variance 
inequality (14. 2p becomes: 

- 1 < (Fn) < 1. (4.7) 



This is the Mermin-Ardehali-Belinskii-Klyshko (MABKjjMer90|, lArdQa . IbK93I | Bell 
inequality, which reduces to the well-known Bell-CHSH CHSH69| inequality for n = 2. 



4.3.1 Quantum bound 



We can now calculate the quantum mechanical bound by writing the variance inequality 
(14.21) and substituting the functions in (14.61) by their corresponding operators 



{Fn)Q < {Fl)Q = {Ft,--^[Fn^i,FU[Xn,Y,])Q 

< ||F^,||+i||[F„_i,F;_J||||[X„,i;]||, (4.8) 

where the norm ||y4|| denotes the modulus of the maximum value of {A)q over all quan- 
tum states. The norm of the second commutator has the bound || [X„, V^] || < 2. It's easy 
to show that = F^_^[XnX] + [Fn-i.K-i] and therefore ||[F„,,F^]|| < 2||F2_^||^ 

II [F„_i, . Solving the recursion relation by noting that ||F;f|| = |||[Xi, Yi]|| = 1 

we finally arrive at the bound 

{Fn)l < 2"-^ (4.9) 



This can be attained with the generalised GHZ states [GBP98|, which therefore violate 
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4.4 Continuous-variables inequalities 

Inspired by those results, we now demonstrate an LHV inequality that is directly appli- 
cable to unbounded continuous variables, in particular field quadrature operators. The 
choice of the function F„ in (14.41) is not optimal though, since the variance in general 
involves incompatible operator products that have no upper bound. 

To overcome this problem, consider a complex function C„ of the local real observables 
{Xk, Yk} defined as: 

n 

Cn = Xn + = \{{Xk + lYk) , (4.10) 

k=l 

SO that the modulus square only involves compatible operator products, i.e. 

n 

\Cn\' = \{{Xl + Y^). (4.11) 

k=X 

Applying the variance inequality to both X„ and Y^, we find that: 

n 

{X^?+{Y^?<([{{Xl + Yi)) (4.12) 
fc=i 

This is the main result of this chapter. Given the assumption of local hidden variables, 
this inequality must be satisfied for any set of observables X^, Ffc, regardless of their 
spectrum. 



4.4.1 Quantum violations 

The fact that we have neglected the commutators in deriving (|4.12l) hints that quantum 
mechanics might predict a violation. We define quadrature operators 

Xk = cike-'''' + die''" 

% = dke-'^'''+'^^/^'> + dle''-''+''^/^\ (4.13) 

where a^, a[ are the boson annihilation and creation operators at site k and G { — 1,1}. 
We now define the operator 

Z, = Xk + tYk (4.14) 

and note that it follows that Cn = Y[k=i ^k- The definition of Yk allows for the choice 
of the relative phase with respect to X^ to be ±7r/2. Depending on s^, for each k either 
Zk = 2dfce~*^*= or Zk = 2a|.e*^*. Denoting ^4^(1) = dk and Ak{—1) = d\, the term in the 
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LHS of (I4.12P in quantum mechanics is then 

\{IIz,)q\' = \2-e'^^^'''Hl[A,{s,))Q\'. (4.15) 

k k 

The RHS becomes 

n n 

([[{Xl + Y^))q = (HiAala, + 2))q (4.16) 

k=l k=l 

regardless of the phase choices. To violate (14.121) we must therefore find a state that 
satisfies 

\{\{Msk))J >{\{{ala, + \))^, (4.17) 

k k 

which is surprisingly insensitive to relative phases between the quadrature measure- 
ments at different sites. 

This violation of a continuous variable Bell inequality can be realised within quantum 
mechanics. Consider an even number of sites, choosing Sfc = 1 for the first half of them 
and Sfc = — 1 for the remaining. To maximise the LHS we need a superposition of terms 
which are coupled by that product of annihilation/creation operators. One choice is a 
state of type 



1^5) = Co |0, . . . , 0, 1, . . . , 1) + ci |1, . . . , 1, 0, . . . , 0) , (4.18) 

where in the first term the first n/2 modes are occupied by zero photons and the 
remaining by 1; conversely for the second term. With that choice of state the LHS of 
fl4.17p becomes |coP|cip, which is maximised by |coP = |cip = |. The RHS is (|)^(|)^ 
independently of the amplitudes cq, ci. Dividing the LHS by the RHS, inequality (I4.17p 
becomes | (|) ^ < 1, which is violated for n > 10, and the violation grows exponentially 
with the number of sites. 



4.4.2 Feasibility 

While setting up the homodyne detectors necessary for this observation is challenging, 
the complexity of this task scales linearly with the number of modes. A more stringent 
constraint is most likely in the state preparation, but we can relate state f l4.18p to a 
class of states of great experimental interest. They can be achieved from a generalised 
GHZ state oin/2 photons, 

|G/fZ(n)) = ^(|i/)®t + |y)®t), (4.19) 
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where \H) and \V) respectively represent single-particle states of horizontal and verti- 
cal polarisation — by splitting each mode with a polarising beam splitter. Therefore 
violation of f l4.12p can be observed in the ideal case with a 5-qubit photon polarisation 
GHZ state and homodyne detection. 

An intere sting question is the effect of decoherence, both from state preparation error 



JCKL06l | and de tector inefficiency. The usual Bell-CHSH violations have an efficiency 



threshold |GM87l | of 83%. This has not yet been achieved for single-ph oton counting. 
Homodyne detection is remarkably efficient by comparison ( |MAL"'"07l | report up to 
94.4%, for example). However, the effect of detector efficiency is easily included by 
assuming that each detected photon mode is preceded by a beam splitter with intensity 
transmission rj < 1. This changes both the LHS and RHS, so that the inequality 
becomes < 4^/", giving a threshold efficiency requirement of rj > rjmin, where 



rjrnin = (1 + v/l+4l-2/n)/4l-2/". 



(4.20) 



This reduces at large n to an asymptotic value of r]oo = 0.80902. Unexpectedly, the 
Bell violation (which signifies a quantum superposition) is less sensitive to detector 
inefficiency in the macroscopic, large n limit. The minimum detector efficiency rjn at 
finite n is plotted in Fig. 1, together with the minimum state preparation fidelity emin 
in the case of ideal detectors, where we model the density matrix as 

/) = e|^5)(*5| + (l-e)/. (4.21) 



4.5 No-go proof for first- moment correlation C.V. Bell 
inequalities 

We will finally prove that there are no LHV inequalities possible if one considers only 
the first-moment correlations between continuous variables in different sites. We will 
show this explicitly for the simplest case and indicate how to generalise to arbitrary 
numbers of parties and settings. Consider first n = 2 parties, Alice and Bob, each 
of which can choose between m = 2 observables: Xa, Ya for Alice and X^, YJ, for Bob. 
Each measurement yields an outcome in the real numbers. The first-moment correlation 
functions for each of the 4 possible configurations are just the averages (XaXb), {XaYb), 
{YaXh), {YoYh). Given those 4 experimental outcomes, can we find a local hidden 
variable model which reproduces them? 
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n 

Figure 4.1: Minimum state preparation fidelity e^m for ideal detectors (solid line), and 
minimum detection efficiency rjmm for ideal state preparation (dashed line) required for 
violation of (14.120 as a function of the number of modes. The asymptotic value of ?7mm 
is indicated by the dash-dotted line. 

We construct an explicit example. Consider a hidden-variable state S where the hidden 
variables are the measured values X, Y, in an equal mixture of four classical pure states 
Sk = {Xa, Ya, Xb, Ffc)^ defined by 

51 = 2(l,0, (X,X,),0) 

52 = 2(l,0,0,(X„n)) 

(4.22) 

S; = 2{0,l,{YaXb),0) 
S^ = 2 {0,1,0, {YaYb)). 

Each of the states Sk assigns a nonzero value to only one of the 4 correlation functions. 
Since the probability of each of the states in the equal mixture is 1/4, we have for 
example {XaXb)s = JE^{XaXt)s. = (X^X^). 

Satisfying the two-site correlations using the state S defined by (|4.22l) leaves us with 
uncontrolled values for the single-site correlations, for instance {Xh)s = j{{XaXh) + 
{YaXf,)). One might object to the fact that this is not equal to (Xj,) in general. However, 
we may correct these lower order correlations by adding four more states (5*5 to 5*8) and 
changing the prefactors multiplying Si to 5*4 to compensate for their reduced weight in 
the equal mixture. Crucially, adding these extra states to S in this manner does not 
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modify the values of correlations such as {XaXb). As an example, we exhibit the state 
^5 = 8 (0, 0, {Xt) - {{XaXh) + {YaX,))/V8, 0), (4.23) 
which corrects the single expectation value {Xb)s to {Xh). 

The proof generalises easily to arbitrary n and m. In that case, there are m" possible 
combinations of measurements which yield n-site correlations. Denoting the j^^ observ- 
able at site i by X- , each combination is specified by a sequence of indices {ji,j2, • • • , jn) ■ 
For each combination of measurements, we define a hidden variable state which assigns 
nonzero values only to the variables which appear in the associated correlation function 
{YYi=i^t)- III analogy to the example above, we can always choose the values of the 
hidden variables associated to X-' such that their product is equal to {YYi=i ^i') ■ 
Since all other m" — 1 states defined in this way will give a value of zero to this partic- 
ular correlation function, and given that the probability associated with each of those 
states is 1/m", we reproduce all correlations as desired. As indicated in the example, 
additional first moment correlations involving less than n sites can be included in the 
LHV model by adding additional states to 5 in a way which doesn't affect the n-site 
correlations. Thus, any possible observation of first moment correlations may be ex- 
plained using a LHV model, and hence these correlations alone cannot violate any Bell 
inequality. In other words, the minimum requirement for a correlation Bell inequality 
with continuous, unbounded variables, is to use not just the first but also the second 
moments at each site. 



4.6 Concluding remarks 

In conclusion, in this chapter we have derived a new class of Bell-type inequalities valid 
for continuous and unbounded experimental outcomes. We have shown that the same 
procedure allows one to derive the MABK class of Bell inequalities and their correspond- 
ing quantum bounds. That derivation makes it explicit that non-zero commutators — 
associated with the incompatibility of the local observables — are the essential ingre- 
dient responsible for the discrepancy between quantum mechanics and local hidden 
variable theories. The new Bell-type inequality derived here can be directly applied to 
continuous variables without the need for a specific binning of the measurement out- 
comes. Surprisingly, quantum mechanics predicts exponentially increasing violations 
of the inequality for macroscopically large numbers of sites, even including realistic 
decoherence effects like inefficient state preparation, and a detector loss at every site. 



Chapter 5 

Generalised Macroscopic 
Superpositions 



5.1 Introduction 



Since Schrodinger's seminal essay of 1935 |Sch35| . in which he introduced his famous 
cat paradox, there has been a great deal of interest and debate on the subject of the 
existence of a superposition of two macroscopically dis tinguis hable states. This issue is 
closely related to the so-called measurement problem |Zur9l| . Some attempts to solv e 



this problem, such as that of Ghirardi, Rimini, Weber and Pearle |GRW86l . 1GPR90|, 
introduce modified dynamics that cause a collapse of the wave function, effectively 
limiting the size of allowed superpositions. 

It thus becomes relevant to determine whether a sup erposition of states w ith a cer- 



tain leve 



BHD+96 



of distinguishabili t y can exist e xperimentallv |MSPB03l|. Eviden ce 



FPC+nn 



RBHOl 



MMKW96I . 



AMM+n 



DMSsnn 



BHU+n2 



OTBLOnal for quantum 



superpositions of two distinguishable states has been put forward for a range of different 
physical systems including SQUIDs, trapped ions, optical photons and photons in mi- 
crowave hi gh-Q ca vities. Signatures for the size of superpo sitions h ave been discussed 
by Leggett |Leg02| and, more recently, by Korsbakken et al |JIKC07l |. Theoretical work 
suggests that the generation of a superpositio n of two truly macroscopically distinct 
states will be greatly hindered by decoherence 



CL85 



ZurOaJ. 



In a recent paper [CR06|, we suggested to broaden the concept of detection of macro- 
scopic superpositions, by focusing on signatures that confirm, for some experimental 
instance, a failure of microscopic/mesoscopic superpositions to predict the measured 
statistics. This approach is applicable to a broader range of experimental situations 
based on macroscopic systems, where there would be a macroscopic range of outcomes 
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for some observable, but not nec essarily ju st two that are macroscopically distinct. 
Recent work by Marquardt et al |MAL"'"07l | reports experimental application of this 
approach. 



The paradigmatic example |Leg84 



BHU+n2 



AMM+na 



DMSSn 



3 



LG8n 



BHD+96 



MMKW96 



FPC+nn 



RBHni 



OTBLG06l | of a macroscopic superposition involves two 



states ip+ and macroscopically distinct in the sense that the respective outcomes 
of a measurement x fall into regions o f outco me domain, denoted + and — , that are 
macroscopically different. We argue in |CR06l | that a superposition of type 



(5.1) 



that involves a range of states but with only some pairs (in this case ?/;+and ^_) 
macroscopically distinct must also be considered a type of macroscopic superposition 
(we call these generalised macroscopic superpositions), in the sense that it displays 
a nonzero off-diagonal density matrix element (^+|p|^_), connecting two macroscop- 
ically distinct states, and hence cannot be constru cted from mic r oscopic superposi 



tions of the basis states of x. Such superpositions |Mer80l . lDru83l . 



Per92 



RMDMn2l | 



are predicted to be generat ed in cert a in key macroscopic e xperiments, that have con 
firmed continuous-variable 



,TDV+na 



,TDB+n4 



O 



LGK+n5 



'KP92 



zwL+nn 



GDH+nf 



d. IsYK+nd 



SLW+ni 



BSBLn2 



sspn2 



squeezing and entanglement of atomic ensembles 



squeezing and entanglement, spin 



JKPOl 



JSCiOj], and entanglement 



and vio 



DM98 



ations of Bell inequalities fo r discrete measurements on multi-photon systems 



LBS+04 



RRH+04 



LLHBOll. 



We derive criteria for the detection of the generalised macroscopic (or S'-scopic) su- 
perpositions using continuous variable measurements. These criteria confirm that a 
macroscopic system cannot be described as any mixture of only microscopic (or s-scopic, 
where s < S) quantum superpositions of eigenstates of x. We show how to apply the 
criteria to detect generalised S'-scopic superpositions in squeezed and entangled states 
that are of experimental interest. 

The generalised macrosc opic su perpositions still hold interest from the point of view of 
Schrodinger's discussion |Sch35| of the apparent incompatibility of quantum mechanics 
with macroscopic realism. This is so because such superpositions cannot be represented 
as a mixture of states which give outcomes for x that always correspond to one or other 
(or neither) of the macroscopically distinct regions + and — . The quantum mechanical 



paradoxes associated with t 
discussed in previous works 



le generalised macroscopic superpo sition (15.11) have been 



GR,n6 



Mer8 



3 



Dru83 



RC05 



CRQ7|. 



The criteria derived in this chapter take the form of inequalities. Their derivation 
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utilises the uncertainty principle and the assumption of certain types of mixtures. In 
this respect they are si milar to criteria for inseparability that have been de rived by 
Duan et al. and Simon 



DGCznn 



SimOOl | and Hofmann and Takeuchi HT03l |. Rather 



than testing for failure of separable states, however, they test for failure of a phase space 
"macroscopic separability", where it is assumed that a system is always in a mixture 
(never a superposition) of macroscopically separated states. 

We will note that one can be more gener al in t he derivation of the inequalities, adopt- 
ing the approach of Leggett and Garg |LG85l | to define a macroscopic reality with- 
out reference to any quantum concepts. One may consider a whole class of theories, 
which we refer to as the minimum uncertainty theories (MUT) and to which quan- 
tum mechanics belongs, for which the uncertainty relations hold and the inequalities 
therefore follow, based on this macroscopic reality. The experimental confirmation of 
violation of these inequalities will then lea d to dem onstration of a new type of Einstein- 
Podolsky- Rosen argument (or "paradox") [EPR35|, in which the inconsistency of a type 
of mac roscopic (.S-sc opic) reality with the completeness of quantum mechanics is re- 
vealed 



CRQ6 



RC05l |. A direct analogy exists with the original EPR argument, which 



is a demonstration of the incompatibility of local realism with the completeness of 

|. In our case, the inconsistency of the com- 



R,ein3 



Wisn6 



RDB- 



quantum mechanics 

pleteness of quantum mechanics is shown to be with a macroscopic reality rather than 
the local reality of the original EPR argument. 



5.2 Generalised S'-scopic Coherence 



We introduce in this Section the concept of a generalised S'-scopic coherence [CR06|, 
which we define in terms of failure of certain types of mixtures. In the next Section, we 
link this concept to that of the generalised S'-scopic superpositions (15. ip . 

We consider a system which is in a statistical mixture of two component states. For 
example, if one attributes probabilities pi and p2 to underlying quantum states pi and 
P2, respectively (where pi denotes a quantum density operator), then the state of the 
system will be described as a mixture, which in quantum mechanics is represented as 

P = pipi + P2p2- (5.2) 



This can be interpreted as "the state is either pi with probability pi, or p2 with 
probability ^2"- The probability for an outcome x of any measurable physical quantity 
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X can be written, for a mixture of the type fl5.2p . as 

P{x) = piPiix) + P2P2{X), (5.3) 

where Pi{x) (z = 1, 2) is the probabihty distribution of x in the state pi. 

More generally, in any physical theory, the specification of a state p (where here p is 
just a symbol to denote the state, but not necessarily a density matrix) fully specifies 
the probabilities of outcomes of all experiments that can be performed on the system. 
If we then have with probability pi a state pi which predicts for each observable x a 
probability distribution Pi{x) and with probability p2 a second state which predicts 
P2{x), then the probability distribution for any observable x given such mixture is of 
the form (15.31) . 

The concept of coherence can now be introduced. 

Definition 1: The state of a physical system displays coherence between 
two outcomes Xi and x^ of an observable x iff the state p of the system 
cannot be considered a statistical mixture of some underlying states pi 
and p2, where pi assigns probability zero for X2 and p2 assigns probability 
zero for Xi . 

This definition is independent of quantum mechanics. Within quantum mechanics it 
implies that the quantum density matrix representing the system cannot be decomposed 
in the form (15. 2p . Thus, for example, p = ■^{\ip+) {ip+l + lip-.) {■ip-.\) , where \ip±) = [|xi)± 
|x2)]/v^, does not display coherence between xi and X2 because it can be rewritten to 
satisfy (15. 2p . The definition will allow a state to be said to have coherence between Xi 
and X2 if and only if there is no possible ensemble decomposition of that state which 
allows an interpretation as a mixture (15.21) . so that the system cannot be regarded as 
being in one or other of the states that can generate at most one of Xi or X2. 

We next define the concept of generalised S-scopic coherence. 

Definition 2: We say that the state displays generalised S'-scopic co- 
herence iff there exist xi and X2 with X2 — xi > S (we take X2 > xi ), 
such that p displays coherence between outcomes x < Xi and x > X2. 
This coherence will be said to be macroscopic when S is macroscopic. 

If there is no generalised S'-scopic coherence, then the system can be described as a 
mixture (15.21) where now states pi and p2 assign nonzero probability only for x < X2 
and X > xi respectively. This situation is depicted in Fig. 15.11 
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Figure 5.1: Probability distribution for outcomes x of measurement x. If Xi and X2 
are macroscopically separated, then we might expect the system to be described as the 
mixture (15.21) . where pi encompasses outcomes x < X2, and p2 encompasses outcomes 
X > xi. This means an absence of generalised macroscopic coherence, as defined in 
Section [5^ 



An important clarification is needed at this point. It is clearly a vague matter to 
determine when S is macroscopic. What is important is that we are able to push the 
boundaries of experimental demonstrations of S-scopic coherence to larger values of 
S. We will keep the simpler terminology, but the reader might want to understand 
macroscopic as S-scopic throughout the text. 

Generalised macroscopic coherence amounts to a loss of what we will call a generalised 
macroscopic reality. The simpler form of macroscopic reality that inv olves only tw o 



states macroscopically distinct has been discussed extensively by Leggett |Leg84l . lLG85| . 
This simpler case would be applicable to the situation of Fig. 1 if there were zero prob- 
ability for result in the intermediate region Xi < x < X2. Macroscopic reality in this 
simpler situation means that the system must be in one or other of two macroscopically 
distinct states, pi and p2, that predict outcom es in re gions x < xi and x > X2, respec- 
tively. The term "macroscopic reality" is used jLG85| because the definition precludes 
that the system can be in a superposition of two macroscopically distinct states prior to 
measurement. Generalised macroscopic reality applies to the broader situation, where 
probabilities for outcomes Xi < x < X2 are not zero, and means that where we have 
two macroscopically separated outcomes Xi and X2, the system can be interpreted as 
being in one or other of two states pi and p2, that can predict at most one of xi or X2. 
Again, the term macroscopic reality is used, because this definition precludes that the 
system is a superposition of two macroscopically separated states that give outcomes 
xi and X2 respectively. 
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We note that Leggett and Garg |LG85| define a macroscopic reality in which they do 
not restrict to quantum states pi and p2, but allow for a more general class of theorie s 
where pi and p2 can be hidden variable states of the type considered by Bell |Bel64 |. 
Such states are not restricted by the uncertainty relation that would apply to each 
quantum state, and hence the assumption of macroscopic reality as applied to these 
theories would not lead to the inequalities we derive in this chapter. This point will be 
discussed in Section [5^ but the reader should note the definition of S-scopic coherence 
within quantum mechanics means that pi and p2 are quantum states. 



5.3 Generalised macroscopic and S'-scopic quantum 
Superpositions 

We now link the definition of generalised m acrosco pic coherence to the definition of 



generalised macroscopic superposition states |CR06l |. Generally we can express p as a 
mixture of pure states Thus 

p = ^Pi\ilji){xljil (5.4) 

i 

where we can expand each in terms of a basis set such as the eigenstates \x) of x: 
thus \ipi) = J2x^^\-^)^ being probability amplitudes. 

Theorem A: The existence of coherence between outcomes xi and X2 of an observable 
X is equivalent, within quantum mechanics, to the existence of a nonzero off-diagonal 
element in the density matrix, i.e, p \x2) 7^ 0. 

Proof: The proof is given in Appendix A. ■ 

Theorem B: In quantum mechanics, there exists coherence between outcomes xi and 
X2 of an observable x iff in any decomposition (IB.ll ) of the density matrix, there is a 
nonzero contribution from a superposition state of the type 

l^s) = Ca;i \Xi) + Ca;2 \X2) + ^ Oj; \x) (5.5) 

with Ca;,,Cx2 ^ 0. 

Proof: If each \ipi) cannot be written in the specific form (15. 5p . then each \ipi){ipi\ is 
either of form pi or p2, so that we can write p as the mixture (15. 2^ . Hence the existence 
of coherence, which implies p cannot be written as (15. 2p . implies the superposition 
must always exist in (IB.ll ). The converse is also true: if the superposition exists in any 
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decomposition, then there exists an irreducible term in the decomposition that assigns 
nonzero probabilities to both Xi and X2, and therefore the density matrix cannot be 
written as (15.21) . ■ 

We say that a generalised S-scopic superposition of states and \x2) exists when 
any decomposition (IB. II) must contain a nonzero probability for a superposition (15.51) . 
where xi and X2 are separated by at least S. Throughout this chapter, we define the 
size S of the generalised superposition 

|^) = ^Cfc|xfc) (5.6) 

k 

(where \xk) are eigenstates of x and each 7^ 0) to be the range of its prediction for 
X, this range being the maximum value of \xk — Xj\ where \xk) and \xj) are any two 
components of the superposition ( 15. 6p (so Cfc,Cj 7^ 0). 

From the above discussions it follows that within quantum mechanics, the existence 
of generalised S'-scopic coherence between Xi and X2 (here |x2 — Xil = S) implies the 
existence of a generalised S'-scopic superposition of type (15.51) . which can be written as 

= c^ip^ + CQipo + c+^p+, (5.7) 

where the quantum state ip- assigns some nonzero probability only to outcomes smaller 
than or equal to Xi, the quantum state ip+ assigns some nonzero probability only to 
outcomes larger than or equal to X2, and the state ipo assigns nonzero probabilities only 
to intermediate values satisfying Xi < x < X2. Where S is macroscopic, expression (I5.7P 
depicts a generalised macroscopic superposition state. In this case then, only the states 
ip- and ip+ are necessarily macroscopically distinct. We regain the traditional extreme 
macroscopic quantum state c_V'- + c+'0+ when Cq = 0. 



5.4 Minimum Uncertainty Theories 

We now follow a procedur e similar t o that used to derive criteria useful for the con- 



firmation of inseparability |DGCZ00l |. The underlying states pi and p2 comprising the 
mixture (15 ■2p are themselves quantum states, and so each will satisfy the quantum 
uncertainty relations with respect to complementary observables. This and the as- 
sumption of (15. 2p will imply a set of constraints. The violation of any one of these is 
enough to confirm the observation of a generalised macroscopic coherence- that is, of a 
generalised macroscopic superposition of type (15.71) . 

While our specific aim is to develop criteria for quantum macroscopic superpositions, 
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we present the derivations in as general a form as possible to make the point that ex- 
perimental violation of the inequalities would imply not only a generalised macroscopic 
coherence in quantum theory, but a failure of the assumption (15.31) in all theories which 
place the system in a probabilistic mixture of two states, which we designate by pi and 
P2, and for which the appropriate uncertainty rela tion ho lds for each of the states. In 
this sense, our approach is similar to that of Bell |Bel64| . except that the assumption 
used here of minimum uncertainties for outcomes of measurements would be regarded 
as more restrictive than the general hidden variable theories considered by Bell. 

We make this point more specific by defining a whole class of theories, which we refer to 
as the Minimum Uncertainty Theories (MUT), that embody the assumption that any 
state p within the theory will predict the same uncertainty relation for the variances 
of two incompatible observables x and p as is predicted by quantum mechanics. This 
is a priori not an unreasonable thing to postulate for a theory that may differ from 
quantum mechanics in the macroscopic regime but agree with all the observations in 
the well-studied microscopic regime. Here we will focus on pairs of observables, like 
position and momentum, for which the uncertainty bound is a real number, which with 
the use of scaling and choice of units will be set to 1, so we can write the an uncertainty 
relation assumed by all MUT's as 

A^xA^p > 1, (5.8) 

where A^x and A^p are the variances of x and p respectively. This is Heisenberg's un- 
certainty relation, and quantum mechanics is clearly a member of MUT. Other quantum 
uncertainty relations that will be specifically used in this chapter include 

A^x + A^p > 2, (5.9) 

which foll ows from (15. 8p and has been useful in derivation of inseparability criteria 

DGCzncj . 



5.5 Signatures for generalised S-scopic superpositions: 
Binned domain 

In this Section we will derive inequalities that follow if there are no s-scopic superpo- 
sitions (where s > S"), so that violation of these inequalities implies existence of an 
S'-scopic superposition (or coherence), as defined in Sections 15.21 and [531 The approach 
is similar to that often used to detect entangled states. Separability implies inequalities 
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Figure 5.2: Probability distribution for a measurement x . We bin results to give three 
distinct regions of outcome: 0, — 1,+1. 



such as those derived by Duan et al. and Simon |DGCZ00l . lSimOCl| . and their violation 
thus implies existence of entanglement. This app roach has been used to experimentally 
confirm entanglement, as described in reference BSLR03l |. among others. An experi- 



me ntal descri ption of the approach we use here has also been outlined by Marquardt et 
al. 



MALIOTJ. 



We consider two types of criteria for the detection of a gene ralised macroscopic super- 



3i, 



CRoai, will be considered 



position (or coherence). The first, of the type considered in 
in this section and uses binned outcomes to demonstrate a generalised S-scopic super- 
position of states and ^_ that predict outcomes in specified regions denoted +1 and 
— 1 respectively (Fig. 2), where these regions are separated by a minimum distance S. 
We expand on some earlier results of |CR06l | for completeness and also introduce new 
criteria of this type. 



5.5.1 Single system 

Consider a system A and a macroscopic measurement x on A, the outcomes of which 
are spread over a macroscopic range. We partition the domain of outcomes x for 
this measurement into three regions, labelled / = —1,0,1 for the regions x < —S/2, 
—S/2 < X < S/2, X > S'/2, respectively. The probabilities for outcomes to fall in those 
regions are denoted p_, po and p+, respectively (Fig. 2). 

If there is no generalised S-scopic coherence then there is no coherence between out- 
comes in / = 1 and / = — 1, and the state of system A can be written as 



Pmix = PlPL + PrPR, 



(5.10) 
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where pl predicts outcomes in the region x < S/2, pji predicts outcomes in the region 
X > —S/2, and pL and pR are their respective probabilities. The assumption of this 
mixture (IS.lOp implies 

P{y) = PLPL{y) + PRPB{y)- (5.11) 

Here y is the outcome of some measurement that can be performed on the system, and 
PR/hiy) is the probability for a result y when the system is specified as being in state 
Pr/l- Where the measurement performed is x, so ?/ = x, there is the constraint on 
(EH]) so that Pr{x) = for X < -S/2 and Pl{x) = for x > S/2. 

Now consider an observable p (with outcomes p) incompatible with x, such that the 
variances are constrained by the uncertainty relation A^xA^p > 1. Our goal is to 
derive inequalities from just two assumptions: firstly, that x and p are incompatible 
observables of quantum mechanics (or of a Minimum Uncertainty Theory) , so the uncer- 
tainty relation holds for both pr/l] and, secondly, that there is no generalised S-scopic 
coherence. 

Violation of these inequalities will imply that one of these assumptions is false. Within 
quantum mechanics, for which the first assumption is necessarily true, that would imply 
the existence of a generalised macroscopic superposition of type f l5.7p with outcomes xi 
and X2 separated by at least S. 

If the quantum state is of form (IS.lOp or if the theory satisfies (15.111) . then 

AV > PlA^p + prA>, (5.12) 

where A^j), A^^p and A^/jp are the variances of p in the states pmix^ Pl and pr, re- 
spectively. This follows simply from the fact the variance of a mixture cannot be 
less than the average variance of its component states. Specifically, if a probabil- 
ity distribution for a variable z is of the form P{z) = ^iliPi-P(^), then A'^z = 

We can now, using (15.121) and the Cauchy-Schwarz inequality, derive a bound for a 
particular function of variances that will apply if the system is describable as (I5.10p 

{plAIx + prAIx)a^p > ^A'p] 

i=L,R i=L,R 

> [ J2 P^^^X^iP? (5-13) 
i=L,R 

> 1. 

The left hand side is not directly measurable, since it involves variances of x in two 
states which have overlapping ranges of outcomes. We must derive an upper bound for 
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A^y^x in terms of measurable quantities. For this we partition the probability distri- 
bution Pr{x) according to the outcome domains / = 0, 1, into normalised probability 
distributions Pro{x) = Pr{x\x < S/2), and P+{x) = Pr{x\x > S/2): 

Pr{x) = ProPro{x) + Pr+P+{x). (5.14) 

Here pRj^ = ^^i^PR{x)dx = p+ and pRo = J^^"^ PR{x)dx. It follows that A|.a; = 
pRoA^f^x + pR+ Alx + pRopR+{fi+ - URof, where /i+(A^a;) and fiRo (A^qx) are the 
averages (variances) of P+{x) and Pro{x), respectively. Using the bounds pRo < 
Po/(po + P+), A%qX < 5*74, pR+ < 1 and < fi+ - fiRo < fi+ + S/2, we derive 

Alx < Alx + -^—[{S/2r + (/.+ + 8/2)'] (5.15) 
Po + P+ 

and, by similar reasoning, 

Alx < Alx + ^° [{S/2f + (//_ - S/2f]. (5.16) 
Po + P- 

Here /i± and A\x are the mean and variance of the measurable P±{x), which, since the 
only contributions to the regions + and - are from Pr{x) and Pl{x) respectively, are 
defined as the normalised + and — parts of P{x), so that P+{x) = P{x\x > S/2) and 
P_(x) = P{x\x < —S/2). We substitute ( 15.15^ in (15.131) . and use po + p+ > pR and 
po + p- > pL 'to derive the final result which is expressed in the following theorem. 

Theorem 1: The assumption of no generalised 5'-scopic coherence between outcomes in 
regions +1 and —1 of Fig. 2 (or, equivalently, of no generalised S'-scopic superpositions 
involving two states ^_ and ^+ predicting outcomes for x in the respective regions +1 
and —1) will imply the uncertainty relations 

iAl,x + poS)A'p>l (5.17) 

and 

Al^x + A'p>2-poS, (5.18) 

where we define Al^,^x = p+A^x + p-A'ix and 5 = + S/2)'' + - S/2)' + 
S'/2} + A^x + A^x. Thus, the violation of either one of these inequalities implies 
the existence of a generalised S'-scopic quantum superposition, and in this case the 
superposition involves states and ip^ predicting outcomes for x in regions +1 and 
— 1, of Fig. 2, respectively.). 

As illustrated in Fig. 2, the A^x and fi± are the variance and mean of P±{x), the 
normalised distribution over the domain / = ±1. p± is the total probability for a 
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result X in the domain / = ±1, while po = 1 ~ (p+ + P-)- The measurement of 
the probability distributions for x and p are all that is required to determine whether 
violation of the inequality (I5.17p or (IS.lSp occurs. Where x and p correspond to optical 
fiel d quadra tures, such distributions have been measured, for example, by Smithey et 
al 



SBRF93|. 



Proof: The assumption of no such generalised S'-scopic superposition implies (15.101) . 
We have proved that (|5.17l) follows. To prove (j5.18l) . we start from (IS.lOp and the 
uncertainty relation (15.91) . and derive a bound that will apply if the system is de- 
scribable as ^M): (pl^Ix + PrA^x) + A^p > [£.^^^^p,A^x] + [Ei=L,Rp^^b] > 
Ei=L,R PilA^x + Ajp] > 2. Using (|5.15|), (!5.16p and po + p+ > Pr and po + p- > Pl 
we get the final result. ■ 



5.5.2 Bipartite systems 

One can derive similar criteria where we have a system comprised of two subsystems A 
and B. In this reduced variance may be found in a combination of observables 

from both subsystems. A common example is where there is a correlation between the 
two positions and of subsystems A and B respectively, and also between the 
two m omenta and P^. Such correlation was discussed by Einstein, Podolsky and 
Rosen |EPR35l | and is called EPR correlation. If a sufficiently strong correlation exists, 
it is possible that both the position difference X^ — X^ and the momenta sum P^ + P^ 
will have zero variance. 

Where we have two subsystems that may demonstrate EPR correlation, we may con- 
struct a number of useful complementary measurements that may reveal generalised 
macroscopic superpositions. The simplest situation is where we again consider super- 
positions with respect to the observable X^ of system A. Complementary observables 
include observables of the type 

P = P^-gP^, (5.19) 

where g is an arbitrary constant and P^ is an observable of system B. We denote the 
outcomes of measurements X^, P^, P^, P by the lower case symbols x^, p^, p^, p 
respectively. The Heisenberg uncertainty relation is 

A^x^Alj^^p"" = A^x^A^p > 1. (5.20) 

We have introduced A^^^^p^ = A'^p so that a connection is ma de with notatio n used 
previously in the context of demonstration of the EPR paradox jReisd . RDBl|. More 
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generally |Rei03l . IRDB 



we define an inference variance 

^?n// = E^(^'')^'(^^|P'')' (5-21) 



which is the average conditional variance for P"^ at A given a measurement of at B. 
The A'^{p^\p^) are the variances of the conditional probability distributions P{p^\p^). 
We note that Aj^j^^p^ is the linear regression est imate o f A^^^p"^, but that we have 



^infP — ^inf,LP fo'^ the case of Gaussian states [RDBlfl. The uncertainty relation 



A^^Aiy > 1 (5.22) 



and also A^p^Af^^x^ > 1, holds true for all quantum states |CR07l |. so that we can 
interchange Af^^p^ with A^^^ jj)^ in the proofs and theorems below. 

Theorem 2: Where we have a system comprised of subsystems A and i?, the absence 
of generalised S'-scopic superpositions with respect to the measurement implies 

(ALx^ + Po5)A^„y >1. (5.23) 

A^^gX^, po and 5 are defined as for Theorem 1 for the distribution P{x^). A^^jp^ is 
defined by (15.211) and involves measurements performed on both systems A and B. The 
inequality (15.231) also holds replacing A^^^p^ with A^^j jj)^ which is defined by (15.201) . 
Thus violation of (|5.23l) implies the existence of the generalised S-scopic superposition, 
involving states predicting outcomes for in regions +1 and —1. 

Proof: The proof follows in identical fashion to that of Theorem 1, except in this 
case the pL and pr of ( I5.10p are states of the composite system, and there is no con- 
straint on these except that the domain for outcomes of X^ is restricted as specified 
in the definition of Pr/l- The expansion (IB. Ill for the density matrix as a mixture is 



p = J2r Prli^r) {i^r\ whcrc uow ipr = J2i j Cij\xi) A\xj) B, B bciug eigcustates of an ob- 
servable of system B that form a basis set for states of -B. The generalised superposition 
(15.51) thus becomes in this bipartite case 

\ll)r) = Ci \ui)b + C2 \x2)j^ \U2) B + ^ Cij \Xi) J^ \Xj)B, (5.24) 

where and \u2) are pure states for system B. If we assume no generalised S'-scopic 
superposition, then p can be written without contribution from a state of form ( I5.24p 
and we can write p as (15.10p . The constraint (15.10p implies P{p) = J2i=r l Pi^iiP) 
where Pr\i{j>) is the probability distribution of p for state Pr/l- Thus (15.121) also holds 
for p replacing p, as do all the results ( l5.14p -( l5TT6l) involving the variances of x. Also, 
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(I5.12P holds for Aj^jp"^ (see Appendix B). Thus we prove Theorem 2 by following 
fl5T2|l - fl5T7D . ■ 

In order to violate the inequality f l5.23p . we would look to minimise Aj^jp^^ or Af„j^p^ = 
A^p. For the optimal EPR states, + has zero variance, and one would choose 
for p the case of (7 = —1, so that p = p^ + p^ , where p^ is the result of measurement 
of P^ at B. This case gives Af^jp"^ = 0. More generally for quantum states that are 
not the ideal case of EPR, our choice of p becomes so as to optimise the violation of 
(15 .23^ and will depend on the quantum state considered. This will be explained further 
in Section [5^81 

A second approach is to use as the macroscopic measurement a linear combination of 
observables from both systems A and B, so for example we might have x = (X^ + 
X^)/^/2 and p = (P^ + P^)/V2. Relevant uncertainty relations include (based on 
\[X^,P^]\ = 2 which gives AxMp^ > 1) 

A(x^ + x^)A(/+p^) > 2 (5.25) 

and 

A2(x^ + x^)+A2(/+p^) >4. (5.26) 

and from these we can derive criteria for generalised S-scopic coherence and superposi- 
tions. 

Theorem 3: The following inequalities if violated will imply existence of generalised 
S-scopic superpositions. 

A I R A I R 

2 ^X + X^ ^ ^\/^-?rP^P 



K.ei^^) + Po5)A\^--^) > 1 (5.27) 

A , B A , B 

AL(^-^) + ^"i-^) > 2 - PoS- (5.28) 

We write in terms of the normalised quadratures so that, following (15.250 . A^( '^'^^^ ) < 
1 would imply squeezing of the variance below the quantum noise level. The quantities 
A^^gX, pQ and 6 are defined as for Theorem 1, but we note that P{x) in this case is the 
distribution for x = (X^ + X^)/y/2. S now refers to the size of the superposition of 
{X^ + X^)/V2. 

Proof: In this case the pji/^ of f l5.10p are defined as specified originally in (I5.10p 
but where x is now defined as the outcome of the measurement x = (X^ + X^)/^/2. 
The failure of the form fl5.10p for p is equivalent to the existence of a generalised 
superposition of type (15.241) where now \xi) refers to eigenstates of X^ + X^. Thus the 



5.6 Signatures of non-locatable generalised S-scopic superpositions 



81 



eigenstates are of the general form \xi) = Cj\xj)A\xi — Xj)B- The mixture (IS.lOp 
implies (j5.12l) where now p refers to the outcome of p = (P^^ + P^)v^, and will imply 
a similar inequality for x. Application of uncertainty relation (I5.25p for the products 
can be used in (I5.13p . and the proof of (15.270 follows as in (15. 120 - 115. 171) of Theorem 1. 
The second result follows by applying the procedure for proof of (|5.18p but using the 
sum uncertainty relation (15.26^ . ■ 

5.6 Signatures of non-locatable generalised S-scopic 
superpositions 

A second set of criteria will be developed to demonstrate that a generalised S-scopic 
superposition exists, so that two states comprising the superposition predict respective 
outcomes separated by at least size S, but in this case there is the disadvantage that 
no information is obtained regarding the regions in which these outcomes lie. 

This lack of information is compensated by a far simpler form of the inequalities and 
increased sensitivity of the criteria. For pure states, a measurement of squeezing Ap 
implies a state that when written in terms of the eigenstates of x is a superposition 
such that Ax > 1/Ap. With increasing squeezing, the extent S of the superposition 
increases. To develop a simple relationship between S and Ap for mixtures, we assume 
that there is no such generalised coherence between any outcomes of x separated by a 
distance larger than 5". This approach gives a simple connection between the minimum 
size of a superposition describing the system and the degree of squeezing that is mea- 
sured for this system. The drawback is the loss of direct information about the location 
(in phase space for example) of the superposition. We thus refer to these superpositions 
as "non-locatable". 

5.6.1 Single systems 

We consider the outcome domain of a macroscopic observable x as illustrated in Fig. 
3, and address the question of whether this distribution could be predicted from micro- 
scopic, or s-scopic {s < S), superpositions of eigenstates of x alone. 

The assumption of no generalised S-scopic coherence (between any two outcomes of the 
domain for x) or, equivalently, the assumption of no generalised S'-scopic superpositions, 
with respect to eigenstates of x, means that the state can be written in the form 



i 



(5.29) 



82 



Generalised Macroscopic Superpositions 




Figure 5.3: We consider an arbitrary probability distribution for a measurement x that 
gives a macroscopic range of outcomes. 



Here each psi is the density operator for a pure quantum state that is not such a 
generalised S'-scopic superposition, so that psi has a range of possible outcomes for x 
separated by less than S. Hence psi = \4'si){i^Si\ where 

li'si) = '^Ck\xk) (5.30) 

k 

but the maximum separation of any two states \xk),\xk') involved in the superposition 
(that is with c^, c^/ 7^ ) is less than S, so \xk — Xk'\ < S. 



Assumption (15.291) will imply a constraint on the measurable statistics, namely that 
there is a minimum level of uncertainty in the prediction for the complementary ob- 
servable p. The variances of each psi must be bounded by 

^^SiX < (5-31) 

It is also true that 

A^p > J2 pA^siP- (5.32) 

i 

Now the Heisenberg uncertainty relation applies to each psi (the inequality also applies 
to the MUT's discussed in Section EHj) so for the incompatible observables x and p 



(5.33) 
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Thus a lower bound on the variance of p follows. 



A^p > Y^Pi^^siP (5.34) 

i 

We thus arrive at the following theorem. 

Theorem 4: The assumption of no generalised 5'-scopic coherence in x will imply the 
following inequality for the variance of outcomes of the complementary observable p: 

Ap > |. (5.35) 

The main result of this section follows from Theorem 4 and is that the observation of 
a squeezing Ap in p such that 

Ap < 2/S (5.36) 
will imply the existence of an S'-scopic superposition 

Cx\x) + c^+s\x + S) + (5.37) 

namely, of a superposition of eigenstates \x) of x, that give predictions for x with a range 
of at least 5*. The parameter S gives a minimum extent of quantum indeterminacy with 
respect to the observable x. Here and c^+s represent non-zero probability amplitudes. 

In fact, using our criterion (I5.36P squeezing in p {Ap < 1) will rule out any expansion of 
the system density operator in terms of superpositions of \x) with S <2 (Fig. 4). Thus 
onset of squeezing is evidence of the onset of quantum superpositions of size S > 2, the 
size S = 2 corresponding to the vacuum noise level. This noise level may be taken as a 
level of refer ence in de termining the relative size of the superposition. The experimental 
observation 
least 5 = 5. 



SYK+06I I of squeezing levels of Ap ^0.4 confirms superpositions of size at 



5.6.2 Bipartite systems 

For composite systems comprised of two subsystems A and B upon which measurements 
X^, P^, , can be performed, the approach of the previous section leads to the 
following theorem. 

Theorem 5a. The assumption of no generalised S'-scopic coherence with respect to 
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implies 

Am// > |- (5.38) 

^InfP^ is defined as in (I5.21I) . The result also holds on replacing A'^^jp^ with A^^^^p^ 
as defined in (|5.20l) . 

Theorem 5b. The assumption of no generalised S-scopic coherence with respect to 
x = {X^ + X^)/V2 implies 

A(^^)>| (.30) 

Proof: The proof follows as for Theorem 4, but using the uncertainty relations (I5.20p 
and ([5:2511 in ([OD instead of flOal) . ■ 

The observation of squeezing such that (|5.38p is violated, i.e 

AinfP^ < 2/S (5.40) 

will imply the existence of an S'-scopic superposition 

Cx\x)a\ui)b + Cx+s\x + S)a\u2)b + (5.41) 

namely, of a superposition of eigenstates \x)a that give predictions for X^ separated by 
at least S. Similarly, the observation of two-mode squeezing such that fl5.39p is violated, 

A,B 

A(^-^) < 2/S, (5.42) 

will imply existence of an S'-scopic superposition of eigenstates of the normalised posi- 
tion sum {X^ + X^)/V2. 



5.7 Criteria for generalised -S'-scopic coherent state su- 
perpositions 

The criteria developed in the previous section may be used to rule out that a system 
is describable as a mixture of coherent states, or certain superpositions of them. If a 
system can be represented as a mixture of coherent states |a) the density operator for 
the quantum state will be expressible as 

p = J P{a)\a){a\(fa (5.43) 
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Figure 5.4: P(x) for a coherent state Ax = Ap = 1. 



which is, since -P(«) is positive for a mixture, the Glauber-Sudarshan P-representation 



Gla63 



Sud63l |. The quadratures x and p are defined as x = a + and p = {a — a^)/i^ 
so that Ax = Ap = 1 for this minimum uncertainty state, where here a, a) are the 
standard boson creation and annihilation operators, so that a\a) = a\a). Proving 
failure of mixtures of these coherent states would be a first requirement in a search for 
macroscopic superpositions, since such mixtures expand the system density operator in 
terms of states with equal yet minimum uncertainty in each of x and p, that therefore 
do not allow significant macroscopic superpositions in either. 

The coherent states form a basis for the Hilbert space of such bosonic fields, and any 
quantum density operator can th us be e xpanded as a mixture of coherent states or 
their superpositions. It is known jwal83| that systems exhibiting squeezing {Ap < 1) 
cannot be represented by a positive Glauber-Sudarshan representation, and hence onset 
of squeezing implies the existence of some superposition of coherent states. A next step 
is to rule out mixtures of Sa-scopic superpositions of coherent states . To define what 
we mean by this, we consider superpositions 

\i^sJ = J2^^\^^) (5-44) 

i 

where for any \ai), \aj) such that Ci,Cj ^ 0, we have \ai — aj\ < Sa for all z, j {sa 
is a positive number). We note that for a coherent state (x) = 2a. Thus the 
separation of the states with respect to x is defined as Sa = 2sa- The "separation" of 
the two coherent states | — a) and |a) (where a is real) in terms of x corresponds to 
Sa = 4:a = 2sa, as illustrated in Fig. 15.51 



We next ask whether the density operator for the system can be described in terms of 
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Figure 5.5: (a) P(x) for a superposition of coherent states (l/v^){e*'^/''| 
(here the scale is such that Ax = 1 for the coherent state 



-a) + e-'^l^\a)} 



the Sa-scopic coherent superpositions, so that 



(5.45) 



where each is of the form fl5.44p . Each predicts a variance in x which has 

an upper limit given by that of the superposition (l/-\/2){e*'^/^| —Sa/2) + e~^'^^^\sa/2)}. 
This state predicts a probability distribution P(x) = | J2± Pg±{x) where 



Pg±{x) 



[X 



exp[ 



(5.46) 



(Fig. 15. 5p . which corresponds to a variance A^x = (x^) = 1 + = 1 + 5*^/4. This 
means each I'lplJ is constrained to allow only A^x < 1 + which implies for each \ips,r) 
a lower bound on the variance A'^p so that A^p > 1/A^x > 1/(1 + s^). Thus using the 



result for a mixture (15.45p . we get that if indeed (15.451) can describe the system, the 
variance in p is constrained to satisfy A'^p > 1/(1 + s^). 

Thus observation of squeezing A^p < 1, so that the inequality 



A^p < 1/(1 + s. 



(5.47) 



is violated, will allow deduction of superpositions of coherent states with separation at 
least Sa- This separation corresponds to a separation of Sa = in x between the 
two corresponding Gaussian distributions (Fig. 15.51) . on the scale where A^x = 1 is the 
variance predicted by each coherent state. 



We note that measured values of squeezing Ap ^ 0.4 ISYK+06I I would imply Sa ^ 2.2 
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This confirms the existence of a superposition of type 

li^s) = ^Ci\ai) = c_| - ao) + ... + c+\ao) (5.48) 

i 

where a separation of at least Sa = \ai — C(j\ = 2.2 occurs between two coherent states 
comprising the superposition, so that we may write ao = 1.1. Note we have defined 
reference axes in phase space selected so that the x-axis is the line connecting the 
two most separated states |aj)and |a-,)so that \ai — aj\ = 2ao and t he p-axis c uts 
bisects this line. The ( 15.48^ can be compared with experimental reports |QTBLG06l | of 
generation of extreme coherent superpositions of type (l/A/2){e*'^/^| — ao) + e~*'^'''^|ao)} 
where |aoP = 0.79, implying ao = 0.89. The corresponding generalised Sq— scopic 
superposition (15.48p as confirmed by the squeezing measurement involves at least the 
two extreme states with |aoP = 1.2, but could include other coherent states with 
|ao| < 1.1. 



5.8 Predictions of particular quantum states 

We will now consider experimental tests of the inequalities derived above. An important 
point is that the criteria presented are sufficient to prove the existence of generalised 
macroscopic superpositions, but there are many macroscopic superpositions which do 
not satisfy the above criteria. Nevertheless there are some systems of current experi- 
mental interest which do allow for violation of the inequalities. We analyse such cases 
below, noting that the violation would be predicted without the experimenter needing 
to make assumptions about the particular state involved. 



5.8.1 Coherent states 

The wave function for the coherent state |a) is 

1 -x^ 

{x\a) = ^exp{— h ax — |a|^}. (5.49) 

(27r)4 4 

This gives the expansion in the continuous basis set |x), the eigenstates of x. Thus for 
the coherent state 

\a) = ^c.\.) = l{.\a)\.)d.. (5.50) 
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Figure 5.6: Plot of {x\p\x') for a coherent state 



The probability distribution for x is the Gaussian (Fig. 15. 4p 

P{x) = \{x\a)\' = -Vexp{ ~^^~^^^' }, (5.51) 
(we take a to be real) centred at 2a and with variance A^x = 1 

The coherent state possesses nonzero off-diagonal elements {x\p\x') where |x— x'| is large 
and thus strictly speaking can be regarded as a generalised macroscopic superposition. 
However, as x and x' deviate from 2a, the matrix elements decay rapidly, and the 
off-diagonal elements decay rapidly with increasing separation. 

a; p a;' = r exp{^- + ^ — '-} 5.52 

(27r)5 ^4 4 ^ ^ ' 

In effect then, the off-diagonal elements become zero for significant separations \x — 
x'\>l (Fig j5.6l) . We can expect that the detection of the macroscopic aspects of this 
superposition will be difficult. Since Ap = 1, it follows that we can use the criterion 
(I5.35P to prove coherence between outcomes of x separated by at most S = 2 (Fig. 15.41) . 
which corresponds to the separation 5* = 2Ax. 



5.8.2 Superpositions of coherent states 



The superposition of two coherent states |WM85l . IYS86 | 



;i/v^){^ 



,i7r/4| 



(5.53) 
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Figure 5.7: Plot of {x\p\x') for the superposition state (15.530 . 

where a is real and large is an example of a macroscopic superposition state. The wave 
function in the position basis is 

We consider the two complementary observables x and p, and note that the probability 
distribution P{x) for x displays two Gaussian peaks centred on a; = ±2a (Fig. 15.51) : 
P{x) = ^J2± Pg±{x) where PG±ix) = exp[— (x — ±2a)^/2]/-\/27r. Each Gaussian has 
variance A'^x = 1. 

The macroscopic nature of the superposition is reflected in the significant magnitude 
of the off-diagonal elements {x\p\x') where x = ±2a and x' = =F2tt, corresponding to 
|x — x'l = 4a. In fact 

e 4 ^"o , 

|(a;|p|x')| = -1= Y cosh(2ax) cosh(2ax'), (5.54) 

V 27r 

as is plotted in Fig. 15.71 and which for these values of x and x becomes r^. With 

significant off-diagonal elements connecting macroscopically different values of x, this 
superposition is a good example of a generalised macroscopic superposition (15.71) . 

Nonetheless we show that the simple linear criteria (15.35^ and (15.17^ derived from ( IB.ljl 
are not sufficiently sensitive to detect the extent of the macroscopic coherence of this 
superposition state (15.531) . even though the state (15.531) cannot be written in the form 
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Figure 5.8: (a) P(p) for a superposition (15.531) of two coherent states where a = 2.5 
and (b) the reduced variance A'^p < 1, versus a. 

(15.1 op . We point out that it may be possible to derive further nonlinear constraints 
from (15.1 op to arrive at more sensitive criteria. 

To investigate what can be inferred from criteria (I5.35p . we note that x is the macro- 
scopic observable. The complementary observable p has distribution P{p) = exp [— p^/2](l+ 
sin 2ap) / VQjt which exhibits fringes and has variance A^p = 1 — exp [— 4q;^] (Fig. 
15.81) . There is a maximum squeezing of A'^p ^ 0.63 at a = 0.5. However, the squeezing 
diminishes as a increases, so the criterion becomes less effective as the separation of 
states of the macroscopic superposition increases. The maximum separation S that 
could be conclusively inferred from this criterion is 5 ~ 2.5 at a = 0.5. 

As discussed in Section 15.71 the detection of squeezing in p is enough to confirm the 
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system is not that of the mixture 

p= l/2{\a){a\ + \- a){-a\) (5.55) 

of the two coherent states. In fact, the squeezing rules out that the system is any 
mixture of coherent states. We note though that since the degree of squeezing Ap is 
small, our criteria is not sensitive enough to rule out superpositions of macroscopically 
separated coherent states. 

5.8.3 Squeezed states 



Consider the single-mode momentum squeezed state|Yue76| 



,r(a2-at2) 



|0). (5.56) 



Here |0) is the vacuum state. For large values of r these states are generalised macro- 
scopic superpositions of the continuous set of eigenstates |x) of a; = a + a^, with wave 
function ^ 

(^l^) = 7r^exp{^}, (5.57) 

and associated Gaussian probability distribution 

n^) = --^exp{^}. (5.58) 

The variance is a = e^^. As the squeeze parameter r increases, the probability distribu- 
tion expands, so that eventually with large enough r, x can be regarded as a macroscopic 
observable. This behaviour is shown in Fig. 15.91 The distribution for p is also Gaussian 
but is squeezed, meaning that it has reduced variance: A^p < 1. In fact, the f l5.56p is 
a minimum uncertainty state, with A^p = 1/a = e"^*". Where squeezing is significant, 
the off-diagonal elements {x\p\x') = {x\ip) {ip\x') (where \x — x'\ is large) are significant 
over a large range of x values (Fig. 15.91) . 

The criterion (15.171) for the binned outcomes is violated for the ideal squeezed state 
f l5.56p for values of S up to O.Si/a. The criterion can thus confirm macroscopic super- 
positions of states with separation of up to half the standard deviation of the probability 



distribution of x, even as Ax oo. This behaviour has been reported in |CR06l | and 
is shown in Fig. 15.101 

Squeezed systems that are generated experimentally will not be describable as the pure 
squeezed state f l5.56p . This pure state is a minimum uncertainty state with AxAp = 1. 
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Figure 5.9: (a) Probability distribution for a measurement X for a momentum-squeezed 
state. The variance A^x increases with squeezing in p, to give a macroscopic range of 
outcomes, and for the minimum uncertainty state (15.561) satisfies AxAp = 1. (b) 
The {x\p\x') for a squeezed state fl5.56p with r = 13.4 (Ax = 3.67) which predicts 
(ata) = 2.52. 
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Figure 5.10: Detection of underlying superpositions of size S for the squeezed minimum 
uncertainty state (I5.56p by violation of ( 15.170 (dashed line of (b)) and (15.35p (full line 
of (b)). Smax is the maximum S for which the inequalities are violated. Inset of (b) 
shows behaviour of violation of (15.170 for general Gaussian-squeezed states. Inequality 
(I5.35P depends only on Ap. The size of Smax relative to P{x) is illustrated in (a). 
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Typically experimental data will generate Gaussian probability distributions for both 
X and p and with squeezing Ap < 1 in but typically AxAp > 1. The maximum 
value of S that can be proved in this case of the Gaussian states reduces to as AxAp 
(or AxAinfp) increases to ~ 1.6. This is shown in Fig. 15.101 Analysis of recent 
experimental data f or impure states that allows a violation of (j5.17p has been reported 
by Marquardt et al |MAL+07| . 

The criterion f l5.35p . as given by Theorem 4, is better able to detect the superpositions 
(Fig. 15.101) . particularly where the uncertainty product gives AxAp > 1, though in this 
case the superpositions are non-locatable in phase space, so that we cannot conclude 
an outcome domain for the states involved in the superposition. This criterion depends 
only on the squeezing Ap in the one quadrature and is not sensitive to the product 
AxAp. For ideal squeezed states with variance A'^x = a, one can prove a superposition 
of size S = 2y/a, four times that obtained from (I5.17P (Fig. 15.101) . 



Experimental reports |SYK"'"06l | of squeezing of orders Ap ^ 0.4 confirms superpositions 
of size at least 5 = 5, which is 2.5 times that defined by S* = 2, which corresponds to 
two standard deviations of the coherent state, for which Ax = 1 (Fig. 15. 4p . 



5.8.4 Two-mode squeezed states 



Next we consider the two- mode squeezed state |CS85 | 



gr-(afe-atfet)|Q^|Q^^ (5.59) 

Here a, b are boson annihilation operators for modes A and B respectively. The wave 
function {x\ip) and distribution P{x) are as in (|5.57l) and (|5.58p . but the variance in 
X = is now given by cr = cosh2r. The x = is thus a macroscopic observable. 

In the two-mode case, the squeezing is in a linear combination P^ + P^ of the momenta 
P^ and P^ at A and B, rather than in the momentum p = P"^ for A itself. The 
observable that is complementary to X"^ is of form P = P"^ — gP^ where g is a 
constant, which is (15.191) of Section 15.51 We can select to evaluate one of the criteria 
((5231), (lOSl ) or (ICTl) . 

Choosing as our macroscopic observable and our complementary one P"^ — gP^ , 
we calculate 

A^^^p^ = l/(r = l/cosh2r (5.60) 



for the choice g = (P^P^) / ({P^)"^) = -tanhr which minimises Af^jp-^ (Rei89|. The 
application of results to criterion (15.231) gives the result as in Fig. 15.101 to indicate 
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detection of superpositions of size S where S = 0.5^/o' for the ideal squeezed state 
fl5.59p . and the result shown in the inset of Fig. 15.91 if Ax^Aj„jp^ > 1. 

The prediction for the criterion of Theorem 3, to detect superpositions in the position 
sum + by measurement of a narrowed variance in the momenta sum + P^, 
is also given by the results of Fig. 15.101 Calculation for the ideal state (|5.59l) predicts 

= e"*"^^ which corresponds to that of the single-mode 



A 



-2r 



and A 



squeezed state. The prediction for the maximum value of S of Theorem 3 is therefore 
given by the dashed curves of Fig. 15.101 and the inset. 



A better result is given by (15.381) . if we are not concerned with the location of the 
superposition. Where we use (15.381) . the degree of reduction in Aj^jp"^ determines the 
size of superposition S that may be inferred. By Theorem 5, measurement of Ainjp^ 
allows inference of superpositions of eigenstates of X^ separated by at least 



S = 2/A,„y 



(5.61) 



Realistic states are not likely to be pure squeezed states as given by (|5.59l) . Nonetheless 
the degree of squeezing indicates a size of superposition in X^ , as given by Theorem 
5. Experimental values of A'^^jp^ ^ 0.76 have been reported [BSLR03|, to give con- 
firmation of superpositions of size 5* ~ 2.3, which is 1.1 times the level of S' = 2 that 
corresponds to two standard deviations Ax^ = 1 of the vacuum state (Fig. 15.41) . 

More frequently, it is the practice to measure squeezing in the direct sum + P^ 
of momenta. The macroscopic observable is the po sition sum X^ + X^. The reports 
of measured experimental values indicate LCK+05 1 A^( ^ ^ ) ^ 0.4, which according 
to Theorem 5 implies superpositions in {X^ + X^)/\^ of size S ~ 3.2, of order 1.6 
times the standard vacuum state level . The slightly better experimental result for 
the superpositi ons in th e position sum may be understood since it has been shown by 



Bowen et al 

.2 



BSLR03l | that, for the Gaussian squeezed states, the measurement of 



'^infP is more sensitive to loss than that of A (p +p ). The Ai„ fP is an asymmetric 
measure that enables de monstr a tion of t he EPR paradox |Rei89l . lRei03l |. a strong form 



of quantum nonlocality 



RDB- 



WJDO 



5.9 Concluding remarks 

In this chapter we have derived criteria sufficient to detect generalised macroscopic 
(or S-scopic) superpositions (^fc^Cfc|xfc)) of eigenstates of an observable x. For these 
superpositions, the important quantity is the value S of the extent of the superposition, 
which is the range in prediction of the observable {S is the maximum of \xj — Xi\ where 
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Cj,Ci 7^ 0). This quantity gives the extent of indeterminacy in the quantum prediction 
for X. In this sense, there is a contrast with the prototype macroscopic s uperpo sition ( 



of type C2IX2) + ci|xi)) that relates directly to the essay of Schrodinger |Sch35l |. Such 
a prototype superposition contains only the two states that have separation S in their 
outcomes for x. Nonetheless, we have discussed how the generalised superposition is 
relevant to testing the ideas of Schrodinger, in that such macroscopic superpositions 
are shown to be inconsistent with the hypothesis of a quantum system being in at most 
one of two macroscopically separated states. 

We have also defined the concept of a generalised S'-scopic coherence and the class of 
Minimum Uncertainty Theories without direct reference to quantum mechanics. The 
former is introduced in Section 15.21 as the assumption (15.31) and is associated to the 
failure of a generalised assumption of macroscopic reality. This assumption is that the 
system is in at most one of two macroscopically distinguishable states, but that these 
underlying states are not specified to be quantum states. The assumption of Minimum 
Uncertainty Theories is that these component states do at least satisfy the quantum 
uncertainty relations. In the derivation of the criteria of this chapter, only two assump- 
tions are made: that the system does satisfy this generalised macroscopic (S'-scopic) 
reality and that the theory is a Minimum Uncertainty Theory. These assumptions 
lead to inequalities, which, when violated, generate evidence that at least one of the 
assumptions must be incorrect. 

We point out that if, in the event of violation of the inequalities, we opt to conclude 
the failure of the Minimum Uncertainty Theory assumption, then this does not imply 
quantum mechanics to be incorrect, but rather that it is incomplete, in the sense that 
the component states can themselves not be quantum states. It can be said then that 
violation of the inequalities of this chapter implies at least one of the assumptions of 
generalised macroscopic (S-scopic) reality and the completeness of quantum mechanics 
is incorrect. 



There is a similarity with the Einstein-Podolsky- Rosen argument |EPR35l |. In the 
EPR argument, the assumption of a form of realism (local realism) is shown to be 
inconsistent with the completeness of quantum mechanics. Therefore, as a conclusion 
of that argument, one is left to conclude that at least one of local realism and the 



R,ein3 



Wisn6l . 1rDB1|. EPR opted for the first and 



completeness of QM is incorrect 
took their argument as a demonstration that quantum mechanics was incomplete. Only 
after Bell |Bel64| was it shown that this was an incorrect choice. Here, as in the EPR 
argument, the assumption of a form of realism (macroscopic (S'-scopic) realism) can 
only be made consistent with the predictions of quantum mechanics if one allows a 
kind of theory in which the underlying states are not restricted by the uncertainty 
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Chapter 6 



Conclusion 



As mentioned in the Introduction, the objective of this thesis was to contribute to the 
area or research termed experimental metaphysics. Our modest contribution was in 
formalising old concepts, proposing new ones, and finding new results in well-studied 
areas. We have also proposed experiments to test each of the major results. It is 
experimental metaphysics, after all. 

In Chapter [2] we set up the appropriate terminology and the basic concepts. Most of 
it were simply careful definitions of standard concepts, but some definitions were new 
and some results and consequences may not have been fully appreciated before. 

n Chap ter [3] we analysed the original argument of Einstein, Podolsky and Rosen (EPR) 



EPR35|, and proposed a general mathematical form for the assumptions behind that 



argument, namely those of local causality and completeness of quantum the ory. Tha t 
entailed what was termed a Local Hidden State model by Wiseman et al. fWJDOi] . 
which was p roposed as a formalisation of the concept of steering first introduced by 
Schrodinger |Sch35l | in a reply to the EPR paper. Violation of any consequences that 
can be derived from the assumption of that model therefore implies a demon stratio n of 
the EPR paradox. We have re-derived the well-known EPR-Reid criterion |Rei89l | for 
continuous-variables correlat ions, and derived new ones applicable to the spin setting 
considered by Bohm Boh5ll |. 

The spin set-up of the EPR-Bohm paradox was used by Bell |Bel64| to derive his now 
famous theorem demonstrating the incompatibility of the assumption of local causality 
and the predictions of quantum mechanics. The inequalities which bear his name can 
be derived for any number of discrete outcomes, but so far there has been no derivation 
which can be directly applied to the continuous-variables case of the original EPR 
paradox. In Chapter [H we closed the circle by deriving a class of inequalities which make 
no explicit mention about the number of outcomes of the experiments involved, and 
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can therefore be used in continuous-variables measurements with no need for binning 
the continuous results into discrete ones. Apart from that intrinsic interest, these 
inequalities could prove important as a means to perform an unambiguous test of Bell 
inequalities which does not suffer from the logical loopholes that plague all experimental 
demonstrations so far, since optical homodyne detection can be performed with high 
detection efficiency. The technique, which is based on a simple variance inequality, was 
also used to re-derive a large class of well-known Bell-type inequalities and at the same 
time find their quantum bound, making explicit from a formal point of view that the 
non-commutativity of the local operators is at the heart of the quantum violations. 

Finally, in Chapter [5] we addressed the issue of macros copic s uperpositions originally 
sparked by the infamous "cat paradox" of Schrodinger [Sch35||, presented in the same 
seminal paper where he coined the terms entanglement and steering. We considered 
macroscopic, mesoscopic and 'S'-scopic' quantum superpositions of eigenstates of an 
observable, and developed some signatures for their existence. We defined the extent, 
or size 5" of a (pure-state) superposition, with respect to an observable X, as being 
the maximum difference in the outcomes of X predicted by that superposition. Such 
superpositions were referred to as generalised S'-scopic superpositions to distinguish 
them from the extreme superpositions that superpose only the two states that have a 
difference S in their prediction for the observable. We also considered generalised S- 
scopic superpositions of coherent states. We explored the constraints that are placed on 
the statistics if we suppose a system to be described by mixtures of superpositions that 
are restricted in size. In this way we arrived at experimental criteria that are sufficient to 
deduce the existence of a generalised ^-scopic superposition. The signatures developed 
are useful where one is able to demonstrate a degree of squeezing. 
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Appendix A 

Proof of Theorem A of Chapter 



5 



We will now prove the statement that coherence between Xi and X2 is equivalent to a 
nonzero off-diagonal element {xi\p\x2) in the density matrix. As discussed in section [5^ 
within quantum mechanics the statement that there exists coherence between xi and 
X2 is equivalent to the statement that there is no decomposition of the density matrix of 
form ()5.2I) where pi and p2 are density matrices such that {xi\p2\xi) = {x2\pi\x2) = 0. 
Therefore Theorem A can be reformulated as saying that (xi|p|x2) = iff such a 
decomposition does exist. 

It's easy to prove the first direction of the equivalence: if 3{pi, p2, pi, P2} such that 
p = pipi + p2p2 and (xi|p2|xi) = (a;2|pi|a;2) = 0, then (xi|p|x2) = 0. To show this, 
first note that for any density matrix p and W{x,x'}, if {x\p\x) = then (x|p|x') = 0, 
where {x\x') = S^y. Since by assumption (xi|p2|xi) = (a;2|pi|x2) = 0, then (xi|p|x2) = 

Ei Pi{xi\pi\x2) = 0. 

The converse can also be proved. We use the facts that any p can always be written as 
the reduced density matrix of an enlarged pure state, where the system of interest (call 
it A) is entangled with an ancilla B, i.e, 



and th at any bipartite pure state can always be written in the Schmidt decomposition 



where and {|0f)} are orthonormal and rji G [0,1]. The superscript B denotes 

the states of the ancilla and the absence of a superscript denotes the states of the 
system of interest, A. We decompose each pure state \ipi) that appears in the Schmidt 
decomposition in the basis of eigenstates of x as \tpi) = J2k^i,k\^k) ■ By assumption 



p = TrB{\'i')AB{'if\AB} 



(A.l) 



EK95| 




(A.2) 
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Proof of Theorem A of Chapter [5] 



{xi\p\x2) = and therefore J2iVi{^i\'^i){i^i\^2) = Si 2 — 0- We can expand 
I'^ab) as 

I^'ab) = \xi)\1b) + |a;2)|2B) + ^ v^Ci,fc|xfc)|0f ), (A.3) 



A,'>2,i 



where we define the (unnormalized) \1b) = y/Vi<^i,i\w) and \2b) = Y.i y/ViCi,2\4>i) ■ 
The inner product of these two vectors is (1b|2_b) = Xli '7«'^«,i'^i 2- But as shown above 
'7iCj,i'^i 2 = 0? so \1b) and |2b) are orthogonal. We can therefore define an orthonor- 
mal basis with the (normalized) |1b) = \Ib)/ \/Y.i "iMi,!? and \2b) = \'^b) / ^/^~ViWi^^ 
plus additional |jB)with 3 < j < D, where D is the dimension of subsystem 5's Hilbert 
space. Taking the trace of pab = |^a_b)(^ab| therefore yields 

p = TtbIpab} 

= (IbIpab + (2b|pab |2b) 

+ 5^0b|Pab|jb). (A.4) 

J>2 



Now referring to expansion (IA.3I) . we see that {Ib\pab\^b) = X]j'7j|ci,iPki)(a^i| and 



{2b\pab\'2b) = J2iVi\ci,2\'^\x2){x2\- We then define pi = pi = J2iVi\ci,i\'^, p2 = 

1- pi and p2 = ^{Y.iVi\ci,2\^\x2){x2\ + Y.j>2{jB\pAB\jB)}- Obviously (x2|pi|x2) = 0, 
and by substituting (IA.3I) into p2 we see that (a;i|p2|a;i) = 0. Therefore p can be 
decomposed as p = pipi + P2P2 with (xi|p2|xi) = (a;2|pi|a;2) = as desired. 



Appendix B 

Proof of Theorem 2 of Chapter 



5 



We wish to prove that if p can be written as 

P = PLPL + PRPR, (B.l) 

then 

> PL^lf^LP^ + PrAIj^j^^ (B.2) 

where 

pB 

The subscript J refers to the pj from which the probabiUties are calculated. 
We have 

ALX = J2p{p^)a\p^\p^) 

pB 

pB pA 

= EE E piPiip^p')ip^-{p''\p')r 
^ EE E 

pB pA I=R,L 

The inequality follows because (p'^lp^) is the mean of P(p^Ip^) for the p of (IbH ). and 
the choice a = J2pPip)P — (p) ^^^^ minimise '^pP{p){p — a)^- From this the desired 
result follows. 
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